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INFRARED MATRIX-ASSISTED LASER DESORPTION/IONIZATION MASS 
SPECTROMETRIC ANALYSIS OF MACROMOLECULES 

RELATED APPLICATIONS 

For U.S. purposes, this application is a continuation-in-part of U.S. 
application Serial No. 09/074,936, filed May 7, 1998, to Franz Hillenkamp, 
entitled "IR-MALDI Mass Spectrometry of Nucleic Acids Using Ljquid Matrices." 
5 Where permitted the subject matter this application is herein incorporated by 
reference in its entirety. 
FIELD OF THE INVENTION 

The disclosed processes relate generally to the field of genomics, 
proteomics and molecular medicine, and more specifically to processes of using 

10 infrared matrix assisted laser desorption-ionization mass spectrometry to 
analyze, or otherwise detect the presence of or determine the identity of a 
biological macromolecule. 
BACKGROUND OF THE INVENTION 

In recent years, the molecular biology of a number of human genetic 

15 diseases has been elucidated by the application of recombinant DNA 

technology. More than 3000 diseases are known to be of genetic origin 
(Cooper and Krawczak, "Human Genome Mutations" (BIOS Publ. 1993)), 
including, for example, hemophilias, thalassemias, Duchenne muscular 
dystrophy, Huntington's disease, Alzheimer's disease and cystic fibrosis, as 

20 well as various cancers such as breast cancer. In addition to mutated genes 
that result in genetic disease, certain birth defects are the result of 
chromosomal abnormalities, including, for example, trisomy 21 (Down's 
syndrome), trisomy 13 (Patau syndrome), trisomy 18 (Edward's syndrome), 
monosomy X (Turner's syndrome) and other sex chromosome aneuploidies such 

25 as Klinefelter's syndrome (XXY). 

Other genetic diseases are caused by an abnormal number of 
trinucleotide repeats in a gene. These diseases include Huntington's disease, 
prostate cancer, spinal cerebellar ataxia 1 (SCA-1), Fragile X syndrome 
(Kremer et al, Science 252:171 1-14 (1991); Fu et al, Cel] 67:1047-58 (1991); 

30 Hirst etaL, J. Med. Genet. 28:824-29 (1991)); myotonic dystrophy type I 
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(Mahadevan et aL, Science 255:1 253-55 (1992); Brook et aL, Cell 68:799-808 
(1992)), Kennedy's disease (also termed spinal and bulbar muscular atrophy (La 
Spada et aL, Nature 352:77-79 (1 991 )), Machado-Joseph disease, and 
dentatorubral and pallidolyusian atrophy. The aberrant number of triplet repeats 
5 can be located in any region of a gene, including a coding region, a non-coding 
region of an exon, an intron, or a regulatory element such as a promoter. In 
certain of these diseases, for example, prostate cancer, the number of triplet 
repeats is positively correlated with prognosis of the disease. 

Evidence indicates that amplification of a trinucleotide repeat is involved 
10 in the molecular pathology in each of the disorders listed above. Although some 
of these trinucleotide repeats appear to be in non-coding DNA, they clearly are 
involved with perturbations of genomic regions that ultimately affect gene 
expression. Perturbations of various dinucleotide and trinucleotide repeats 
resulting from somatic mutation in tumor cells also can affect gene expression 
15 or gene regulation. 

Additional evidence indicates that certain DNA sequences predispose an 
individual to a number of other diseases, including diabetes, arteriosclerosis, 
obesity, various autoimmune diseases and cancers such as colorectal, breast, 
ovarian and lung cancer. Knowledge of the genetic lesion causing or 
20 contributing to a genetic disease allows one to predict whether a person has or 
is at risk of developing the disease or condition and also, at least in some cases, 
to determine the prognosis of the disease. 

Numerous genes have polymorphic regions. Since individuals have any 
one of several allelic variants of a polymorphic region, each can be identified 
25 based on the type of allelic variants of polymorphic regions of genes. Such 
identification can be used, for example, for forensic purposes. In other 
situations, it is crucial to know the identity of allelic variants in an individual. 
For example, allelic differences in certain genes such as the major 
histocompatibility complex (MHC) genes are involved in graft rejection or graft 
30 versus host disease in bone marrow transplantation. Accordingly, it is highly 
desirable to develop rapid, sensitive, and accurate methods for determining the 
identity of allelic variants of polymorphic regions of genes or genetic lesions. 
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Several methods are used for identifying allelic variants or genetic 
lesions. For example, the identity of an allelic variant or the presence of a 
genetic lesion can be determined by comparing the mobility of an amplified 
nucleic acid fragment with a known standard by gel electrophoresis, or by 
5 hybridization with a probe that is complementary to the sequence to be 

identified. Identification only can be accomplished, however, if the nucleic acid 
fragment is labeled with a sensitive reporter function, for example, a radioactive 
( 32 P, 35 S), fluorescent or chemiluminescent reporter. Radioactive labels can be 
hazardous and the signals they produce can decay substantially over time. 

10 Non-radioactive labels such as fluorescent labels can suffer from a lack of 
sensitivity and fading of the signal when high intensity lasers are used. 
Additionally, labeling, electrophoresis and subsequent detection are laborious, 
time-consuming and error-prone procedures. Electrophoresis is particularly 
error-prone, since the size or the molecular weight of the nucleic acid cannot be 

15 correlated directly to its mobility in the gel matrix because sequence specific 
effects, secondary structures and interactions with the gel matrix cause 
artifacts in its migration through the gel. 

Applications of mass spectrometry in the biosciences have been reported 
(see Meth. Enzymol. , Vol. 193, Mass Spectrometry (McCloskey, ed.; Academic 

20 Press, NY 1 990); McLaffery et al, Acc. Chem. Res. 27:297-386 (1 994); Chait 
and Kent, Science 257:1885-1894 (1992); Siuzdak, Proc. Natl. Acad. Sci., USA 
91:11290-1 1297 (1994)), including methods for mass spectrometric analysis of 
biopolymers (see Hillenkamp et al. (1 991 ) Anal. Chem. 63:1 1 93A-1 202A) and 
for producing and analyzing biopolymer ladders (see. International Publ. 

25 WO 96/36732; U.S. Patent No. 5,792,664). 

Mass spectrometry has been used for the analysis of nucleic acids (see, 
for example, Schram, Mass Spectrometry of Nucleic Acid Components, 
Biomedical Applications of Mass Spectrometry 34:203-287 (1990); Crain, Mass 
Spectrom. Rev. 9:505-554 (1990); Murray, J. Mass Spectrom. Rev. 31:1203 

30 (1 996); Nordhoff et aL, Mass Spectrom. Rev. 1 5:67-1 38 (1 997); U.S. Patent 
No. 5,547,835; U.S. Patent No. 5,605,798; PCT Application Publication No. 
W094/16101; PCT Application Publication No. WO 96/29431). 



3NSDOCID: <WO 995731BA2_I_> 



WO 99/57318 



PCT/US99/10251 



"ass spectrometry (MALD, MS AD5 " !!a!!L 60;22 99-300, „ 988,). MALDI 

^ —-,38 ( ; 99 r d ES1 r:: n Nordh ° ,f 

— ^ are very polar .zirrjr" r- nuc,eic acws - 

therefore, there has hp.n a f ' CU,t tC> Vo,atize a "d, 

- ™ 9aDalton mass range (Fe s 2; z: of ,arae nuc,eic acws ™ * 

(199-511 m ° 11395 )- Che naaL. AnaLCheou 67-1, 69 lift, 

(1995)). Mass assignment using ESI i, u »„ 

uncertainty o, about ,0% Th^ ^ P ° SSib ' e With a " 

« -ass d e,ermine d by ES1 . MS a "™ 

and a ,20 nuc,eo,i d e E. coii 5S rRNA or about n ab01 " ^ 

extensive sampie purification. furthermore. ES, requires 

a «*. ^ crystamne) ^ m : ;'~ and ° n nuc,eic ^ — - 

biopoiymer/ma,™ mixture whiel ' 3 '"^ U " d te St " k ° *• 

e«ectin 3 .sorption -S^jnT, " 3 ^ ^ ^ 
25 has been performed on po,y P ep,i de ' ^ addi, ''° n ' MALC "™ 

ulcere, as a matdx. wZTZTt T! ^ °' « 
necessary to (its, t^T™" ^ *" "~ » * ™t ri x. it was 
<Be rk en k amp e, a, ,,996 pL ' *° "^"^ MALt >'-MS 

30 sensitivity (i.e., a, ,eas, , 0 1 , T reP<>rtaa * *" 3 ° W ' ,h " mi<ed 

-Protein, — 



WO 99/57318 



PCT/US99/10251 



can tend to form adducts which broaden the peaks on the high mass side 
(Hillenkamp et aL (1995) 43rd ASMS Conference on Mass Spectrometry and 
Allied Topics, p. 357). Furthermore, although IR-MALDI MS appeared to 
provide increased mass resolution due to less metastable fragmentation as 
5 compared to UV-MALDI MS, this decrease in metastable decay has been 
reported to be accompanied by an increase in fragmentation. 

UV-MALDI-MS is limited in the size of biological macromolecules that can 
be analyzed. For example, it is difficult to analyze nucleic acid molecules much 
larger than about 100 nucleotides (100-mer) by UV-MALDI-MS. 

10 Accordingly, despite the effort to apply mass spectrometry methods to 

the analysis of nucleic acid molecules, limitations remain due, in part, to 
physical and chemical properties of nucleic acids. For example, the polar nature 
of nucleic acid biopolymers makes them difficult to volatilize. 

Analysis of large DNA molecules using UV-MALDI-MS has been reported 

15 (Ross and Belgrader, Anal. Chem. 69:3966-3972 (1997); Tang et aL, Rapid 

Commun. Mass Spectrom. 8:727-730 (1 994); Bai et aL, Rapid Commun. Mass 
Spectrom. 9:1172-1176 (1995); Liu et aL, Anal. Chem. 67:3482-3490 (1995); 
Siegert et aL, Anal.Biochem. 243:55-65 (1997)). Based on these reports, it is 
clear that analysis of nucleic acids exceeding 30 kDa in mass (approximately a 

20 100-mer) by UV-MALDI-MS becomes increasingly difficult with a current upper 
mass limit of about 90 kDa (Ross and Belgrader, Anal. Chem. 69:3966-3972 
(1997)). The inferior quality of the DNA UV-MALDI spectra has been attributed 
to a combination of ion fragmentation and multiple salt formation of the 
phosphate backbone. Since RNA is considerably more stable than DNA under 

25 UV-MALDI conditions, the accessible mass range for RNA is up to about 
1 50 kDa (Kirpekar et aL, Nucl. Acids Res. 22:3866-3870 (1 994)). 

Nucleic acids in solid matrices (mostly succinic acid and, to a lesser 
extent, urea and nicotinic acid) have been analyzed by IR-MALDI (Nordhoff et 
aL, Rapid Commun. Mass Spectrom. 6:771-776 (1992); Nordhoff et aL, Nucl. 

30 Acids Res. 21 : 3347-3357 (1 993); Nordhoff et aL, J. Mass Spec. 30:99-1 1 2 
(1995)). Nordhoff et aL (1992) initially reported that a 20-mer of DNA and an 
80-mer of RNA were about the uppermost limit for resolution. Nordhoff et aL 
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particularly glycerol, that can form a glass or vitreous solid. The use of IR- 
MALDI and this liquid matrix can be employed in any method, particularly 
diagnostic methods and sequencing methods, heretofor performed with UV- 
MALDI. Such methods, particularly diagnostic methods for nucleic acids and 
5 proteins, include, but are not limited to, those described in U.S. Patent Nos. 
5,547,835, 5,691,141, 5,605,798, 5,622,824, 5,777,324, 5,830,655, 
5,700,642, allowed U.S. application Serial Nos. 08/617,256, 08/746,036, 
08/744,481, 08/744,590, 08/647,368, published International PCT application 
Nos. WO 96/29431, WO 99/12040, WO 98/20019, WO 98/20166, 

10 WO 98/20020, WO 97/37041 , WO 99/14375, WO 97/42348, WO 98/54751 
and WO 98/26095. 

In practicing an embodiment of the method for nucleic acid analyses, a 
composition for IR-MALDI containing the nucleic acid and a liquid matrix is 
deposited onto a substrate, which, generally, is a solid support, to form a 

15 homogeneous, transparent thin layer of nucleic acid mixture. This mixture is 
illuminated with infrared radiation so that the nucleic acid solution is desorbed 
and ionized, thereby emitting ion particles, which are analyzed using a mass 
analyzer to determine the mass of the nucleic acid. Preferably, sample 
preparation and deposition are performed using an automated device. 

20 Methods for detecting the presence or absence of a biological 

macromoleculein a sample using IR-MALDI mass spectrometry are also provided 
herein. In a particular embodiment, a composition for IR-MALDI containing the 
biological macromolecule and a matrix is illuminated with infrared radiation, 
desorbed and ionized, thereby emitting ion particles, which are analyzed to 

25 determine whether the nucleic acid is present. 

Methods for detecting the presence or absence of a nucleic acid in a 
sample using IR-MALDI mass spectrometry are also provided herein. In a 
particular embodiment, a composition for IR-MALDI containing the sample and a 
liquid matrix is illuminated with infrared radiation, desorbed and ionized, thereby 

30 emitting ion particles which are analyzed to determine whether the nucleic acid 
is present. 
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L,qu,d matrices for use in the processes disclosed herein have a 
sufficient absorption a, the wavelength of the laser to be used in performing 
desorption and ionization and are a ,i q uid a, room temperature ,20°C, and can 

5 2 ! " 9 ' aSS S °' id - ' iqUid ' S imended ,0 be ™* '» -V .R MALDI 

forma, and a, any temperature, typicai.y about -200- C to 80° C. preferably - 
60 C to about 40° C. suitable for such formats. 

For absorption purposes, the liquid matrix can contain a, leas, one 
chromophore or functional group tha, strongly absorbs infrared radiation 

io 2 rr ,u T al 9roups inciude nitr °- su,,onv '- su,,onic «•* -*»-«•. 

n„ „e or cyan,de. carbony.. a.dehyde. carboxylic acid, amide, ester, anhydride 
ketone am.ne. hydroxy,, aromatic rings, dienes and other conjugated systems 

Among the preferred liquid matrices are substituted or unsubstituted 
(D alcohols, including glycerol, sugars, polysaccharides. , ,2-propanedio,, , 3- 
15 <ZZ I 1 ' 2 ; H bUtanedi °'- ^-butanedio,. , .4.butanedio, and ,rie,hano,amine : 
<2, carboxyl.c acds. including formic acid, lactic acid, acetic acid, propionic 
acd. butanoic acid, pentanoic acid and hexanoic acid, or esters thereof 
(3) pnmary or secondary amides, including acetamide. propanamide 
butanamide. pentanamide and hexanamide. whether branched or unbranched; 
(4. pnmary or secondary amines, including propamine, butylamine 

T y ' amine ' famine and dipropylamine; and ' 

<5» n tnles. hydraz.ne and hydrazide. The Hquids do no, crystallize, bu, rather 

"nd Won 77 ° r Vi,re ° US PhaSS Whe ° SUbi6C,ed W *** «*« " other 
condmon ,eed,ng to a transition from the liquid phase. Materials of re,a,ive,y 
low vo,a,,,„y are preferred ,„ ayoid rapid evaporation under conditions of 
25 vacuum during the IR-MALDI processes. 
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confer one or more of the properties described above. Such mixtures can 
contain two liquid matrice materials (i.e.,. binary mixtures), three (tertiary 
mixtures) or more. 

A nucleic acid/matrix composition for IR-MALDI is deposited as a thin 
5 layer on a substrate, which preferably is contained with a vacuum chamber. 
Preferred substrates for holding the nucleic acid/matrix solution can be solid 
supports, for example, beads, capillaries, flat supports, pins or wafers, with or 
without filter plates. Preferably the temperature of the substrate can be 
regulated to cool the nucleic acid/matrix composition to a temperature that is 

10 below room temperature. 

Preferred infrared radiation is in the mid-IR wavelength region from about 
2.5 //m to about 12 fjm. Particularly preferred sources of radiation include CO, 
C0 2 and Er lasers. In certain embodiments, the laser can be an optic fiber laser, 
or the laser radiation can be coupled to the mass spectrometer by fiber optics. 

15 In a further preferred embodiment, the ion particles generated by infrared 

irradiation of the analyte in the liquid matrix are extracted for analysis by the 
mass analyzer in a delayed fashion prior to separation and detection in a mass 
analyzer. Preferred separation formats include linear or reflector, with linear and 
nonlinear fields, for example, curved field reflectron; time-of-fiight (TOF); single 

20 or multiple quadrupole; single or multiple magnetic sector; Fourier transform ion 
cyclotron resonance (FTICR); or ion trap mass spectrometers. 

Processes of using IR-MALDI mass spectrometry to identify the presence 
of a target nucleic acid in a biological sample are provided. Such a process can 
be performed, for example, by amplifying nucleic acid molecules in the 

25 biological sample; contacting the amplified nucleic acid molecules with a 

detector oligonucleotide, which can hybridize to a target nucleic acid sequence 
present among the amplified nucjeic acid molecules; preparing a composition for 
IR-MALDI, by mixing the product of the reaction with a liquid matrix, which 
absorbs infrared radiation; and identifying duplex nucleic acid molecules in the 

30 composition by IR-MALDI mass spectrometry, wherein the presence of duplex 
nucleic acid molecules identifies the presence of the target nucleic acid in the 
biological sample. 
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A process for identifying the presence of a target nucleic acid sequence 
in a biological sample also can be performed by amplifying nucleic acid 
molecules obtained from a biological sample; specifically digesting the amplified 
nucleic acid molecules using at least one appropriate nuclease, to produce 
5 digested fragments; hybridizing the digested fragments with complementary 
capture nucleic acid sequences, which are immobilized on a solid support and 
can hybridize to a digested fragment of a target nucleic acid to produce 
immobilized fragments; preparing a composition for IR-MALDI, containing the 
immobilized fragments and a liquid matrix, which absorbs infrared radiation; and 
10 identifying immobilized fragments by IR-MALDI mass spectrometry, thereby 
detecting the presence of the target nucleic acid sequence in the biological 
sample. 

The presence of a target nucleic acid in a biological sample also can be 
identified by performing on nucleic acid molecules obtained from the biological 

15 sample, a first polymerase chain reaction using a first set of primers, which are 
capable of amplifying a portion of the nucleic acid containing the target nucleic 
acid; preparing a composition containing the first amplification product and a 
liquid matrix, which absorbs infrared radiation; and detecting the first 
amplification product in the composition by IR-MALDI mass spectrometry, 

20 thereby detecting the presence of the target nucleic acid in the biological 
sample. If desired, such a process can include, prior to preparing the 
composition for IR-MALDI, performing a second polymerase chain reaction on 
the first amplification product using a second set of primers that can amplify at 
least a portion of the first amplification product containing the target nucleic 

25 acid. 

Also disclosed herein are compositions, particularly compositions for 
IR-MALDI, such compositions containing a biological macromolecule, which is 
suitable for analysis by IR-MALDI, and a liquid matrix, which absorbs infrared 
radiation. A biological macromolecule suitable for analysis by IR-MALDI can be, 
30 for example, a nucleic acid, a polypeptide or a carbohydrate, or can be a 
macromolecular complex such as a nucleoprotein complex, protein-protein 
complex, or the like. A composition for IR-MALDI as disclosed herein generally 



!3DOCID: «WO 995731BA2.I_> 



WO 99/57318 



PCTAJS99/10251 



contains the biological macromolecule, for example, a nucleic acid, and the 
liquid matrix in a ratio of about 10 4 to 10" 9 , and can contain less than about 10 
picomoles of biological macromolecule to be analyzed, for example, about 
100 attomol to about 1 picomole (pmol) of the biological macromolecule. (For 
5 proteins, the analyte to matrix ratio is typicallyl narrower ranging fromabout 2 x 
10 * to 2 x 10 s ). A composition for IR-MALDI as disclosed herein also can 
contain an additive, which facilitates detection of the biological macromolecule 
by IR-MALDI, for example, an additive that improves the miscibitity of the 
biological macromolecule in the liquid matrix. In one embodiment, a 

10 composition for IR-MALDI is deposited on a substrate, which can be a solid 
support such as a silicon wafer or other material providing a surface for 
deposition of a composition for IR-MALDI, for example, a stainless steel surface. 

Processes for characterizing a biological macromolecule by IR-MALDI 
mass spectrometry are provided. For example, the mass of a biological 

1 5 macromolecule can be determined by preparing a composition for IR-MALDI 
containing the biological macromolecule to be analyzed and a liquid matrix, 
which absorbs infrared radiation; then analyzing the biological macromolecule in 
the composition by IR-MALDI mass spectrometry, thereby allowing a 
determination of the mass of the biological macromolecule. 

20 A process as disclosed herein also can be used for detecting a target 

biological macromolecule by preparing a composition for IR-MALD! containing 
the target biological macromolecule and a liquid matrix, which absorbs infrared 
radiation, and performing IR-MALDI mass spectrometry on the composition to 
identify the target biological macromolecule in the composition, thereby 

25 detecting the target biological macromolecule. If desired, the target biological 
macromolecule can be present in or obtained from a biological sample. 
Accordingly, a process for identifying the presence of a target biological 
macromolecule in a biological sample, is provided. The presence, of a target 
nucleic acid, for example, can be identified by preparing a composition for 

30 IR-MALDI, containing a biological sample containing nucleic acid molecules (or 
nucleic acid molecules isolated from the biological sample) and a liquid matrix, 
which absorbs infrared radiation; then analyzing the composition by IR-MALDI 
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mass spectrometry, wherein detection of a nucleic acid molecule having a 
molecular mass of the target nucleic acid sequence identifies the presence of 
the target nucleic acid sequence in the biological sample. 

Also provided is a process of using IR-MALDI mass spectrometry to 
5 identify an individual having a disease or a predisposition to a disease by 
detecting a characteristic of a biological macromolecule that is obtained from 
the individual and is associated with the disease or the predisposition. Such a 
process is particularly useful for identifying a genetic disease, or a disease 
associated with a bacterial infection, or a predisposition to such a disease, and 
also is useful for determining identity, heredity or compatibility. 

The processes disclosed herein are suitable for analyzing one or more 
target biological macromolecules, particularly a large number of target biological 
macromolecules, for example, by depositing a plurality of compositions, each 
containing one or more target biological macromolecules, on a solid support, for 
example, a chip, in the form of an array. The disclosed processes are 
particularly suitable for multiplex analysis of a plurality of biological 
macromolecules contained in a single composition, including a liquid matrix, in 
which case each biological macromolecule in the plurality can be differentially 
mass modified to facilitate multiplex analysis. Accordingly, the processes 
disclosed herein are readily adaptable to high throughput assay formats. 

Processes for obtaining information on a sequence of a nucleic acid 
molecule by determining the identity of a target polypeptide encoded by the 
nucleic acid molecule are provided. In practicing these methods, a target 
polypeptide (or mixture thereof) is prepared from a nucleic acid molecule 
molecule encoding the target polypeptide; the molecular mass of the target 
polypeptide is determined by providing a mixture of the polypeptide with a liquid 
matrix, or in some embodiments, with water or succinic acid, and preforming IR- 
MALDI. The identity of the target polypeptide is determined by comparing the 
molecular mass of the target polypeptide with the molecular mass of a reference 
polypeptide of known identity. Information, such as the presence of a 
mutation, on a sequence of nucleotides in the nucleic acid molecule encoding 
the target polypeptide can thereby be obtained. 
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A biological macromolecule particularly suitable for analysis by a process 
of IR-MALDI mass spectrometry can be a nucleic acid, a nucleic acid analog or 
mimic, a triple helix, a polypeptide, a polypeptide analog or mimetic, a 
carbohydrate, a lipid or a proteoglycan, or can be a macromolecular complex 
5 such as a protein-protein complex or a nucleoprotein complex or other 

complexes. For analysis by a process as disclosed herein, a target biological 
macromolecule can be immobilized to a substrate, particularly a solid support, 
which can be, for example, a bead, a flat surface, a chip, a capillary, a pin, a 
comb, or a wafer, and can be any of various materials, including a metal, a 

10 ceramic, a plastic, a resin, a gel, and a membrane. Immobilization can be 

through a reversible linkage ( i.e. an ionic bond, such as biotin/streptavidin), a 
covalent bond, such as photocleavable bond or a thiol linkage or a hydrogen 
bond, and the linkage can be cleaved using, for example, a chemical process, an 
enzymatic process, or a physical process, including during the IR-MALDI mass 

15 spectrometric analysis procedure. 

A biological macromolecule to be analyzed can be conditioned prior to IR- 
MALDI mass spectrometric analysis, thereby improving the ability to analyze the 
particular biological macromolecule by IR-MALDI mass spectrometry, for 
example, by improving the resolution of the mass spectrum. A target biological 

20 macromolecule can be conditioned, for example, by ion exchange, by contact 
with an alkylating agent or trialkylsilyl chloride, or by incorporation of at least 
one mass modified subunit of the biological macromolecule. If desired, the 
biological macromolecule can be isolated prior to conditioning or prior to IR- 
MALDI mass spectrometric analysis. 

25 A process for determining the identity of each target biological 

macromolecule in a plurality of target biological macromolecules, which can be 
fragments of a biological macromolecule, can be performed, for example, by 
preparing a composition for IR-MALDI containing a plurality of differentially 
mass modified target biological macromolecules and a liquid matrix, which 

30 absorbs infrared radiation; determining the molecular mass of each differentially 
mass modified target biological macromolecule in the plurality by IR-MALDI 
mass spectrometry; and comparing the molecular mass of each differentially 



WO 99/57318 



PCIYUS99/10251 



mass modified target biological macromolecule in the plurality with the 
molecular mass of a corresponding known biological macromolecule. Where 
such a process is performed using a plurality of target biological 
macromolecules, each of which is a fragment of a larger biological 
5 macromolecule, the fragments can be prepared by contacting the biological 
macromolecules with at least one agent that cleaves a bond involved in the 
formation of the biological macromolecules, particularly a bond between 
monomer subunits of the biological macromolecule. 

Processes for identifying one or more subunits in a biological 

10 macromolecule using IR-MALDI mass spectrometry also are provided, for 

example, processes for detecting a mutation in a nucleotide sequence. The 
identity of a target nucleotide can be identified, for example, by hybridizing a 
nucleic acid molecule containing the target nucleotide with a primer 
oligonucleotide that is complementary to the nucleic acid molecule at a site 

15 adjacent to the target nucleotide, to produce a hybridized nucleic acid molecule; 
contacting the hybridized nucleic acid molecule with a complete set of 
dideoxynucleosides or 3'-deoxynucleoside triphosphates and a DNA dependent 
DNA polymerase, so that only the dideoxynucleosides or 3'-deoxynucleoside 
triphosphate that is complementary to the target nucleotide is extended onto the 

20 primer; preparing a composition containing the extended primer and a liquid 
matrix, which absorbs infrared radiation; and detecting the extended primer in 
the composition by IR-MALDI mass spectrometry, thereby determining the 
identity of the target nucleotide. 

A process for detecting the absence or presence of a mutation in a target 

25 nucleic acid sequence can be performed by hybridizing a nucleic acid molecule 
containing the target nucleic acid sequence with at least one primer, which has 
3' terminal base complementarity to the target nucleic acid sequence, to 
produce a hybridized product; contacting the hybridized product with an 
appropriate polymerase enzyme and sequentially with one of the four nucleoside 

30 triphosphates, then preparing a composition containing the reaction product and 
a liquid matrix, which absorbs infrared radiation; and detecting the product in 
the composition by IR-MALDI mass spectrometry, wherein the molecular weight 
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of the product indicates the presence or absence of a mutation next to the 
3' end of the primer in the target nucleic acid molecule. A mutation in a nucleic 
acid molecule also can be detected, for example, by hybridizing the nucleic acid 
molecule with an oligonucleotide probe, to produce a hybridized nucleic acid, 
5 wherein a mismatch is formed at the site of a mutation; contacting the 
hybridized nucleic acid with a single strand specific endonuclease, then 
preparing a composition containing the reaction product and a liquid matrix, 
which absorbs infrared radiation; and analyzing the composition by IR-MALDI 
mass spectrometry, wherein the presence of more than one nucleic acid 
10 fragment in the composition indicates that the nucleic acid molecule contains a 
mutation. 

A process for identifying the absence or presence of a mutation in a 
target nucleic acid sequence also can be performed, for example, by performing 
at least one hybridization on a nucleic acid molecule containing the target 

15 nucleic acid sequence with a set of ligation educts and a DNA ligase; preparing 
a composition containing the reaction product and a liquid matrix, which 
absorbs infrared radiation; and analyzing the composition by IR-MALDI mass 
spectrometry. Using such a process, the detection of a ligation product in the 
composition identifies the absence of a mutation in the target nucleic acid 

20 sequence, whereas the detection only of the set of ligation educts in the 
composition identifies the presence of a mutation in the target nucleic 
sequence. A process of detecting the presence of a ligation product, as 
disclosed above, also can be useful for detecting a target nucleotide or a target 
nucleic acid by performing at least one hybridization on a nucleic acid molecule 

25 containing the target nucleotide with a set of ligation educts and a thermostable 
DNA ligase; preparing a composition containing the reaction product and a liquid 
matrix, which absorbs infrared radiation; and identifying a ligation product in the 
composition by IR-MALDI mass spectrometry, thereby detecting the presence of 
a target nucleotide in the nucleic acid sequence. 

30 Processes for determining a subunit sequence of a biological 

macromolecule also are provided. A subunit sequence of at least one species of 
target biological macromolecule, i, can be determined, for example, by 
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contacting the species of target biological macromolecule with one or more 
agents sufficient to cleave each bond involved in the formation of the target 
biological macromolecule, to produce a set of nested biological macromolecule 
fragments, then preparing a composition containing at least one biological 
macromolecule fragment of the set and a liquid matrix, which absorbs infrared 
radiation; and determining the molecular mass of the at least one biological 
macromolecule fragment by IR-MALDI mass spectrometry; and repeating these 
steps until the molecular mass of each biological macromolecule fragment in the 
set has been determined, thereby determining the subunit sequence of the 
species of target biological macromolecule. Such a process is particularly 
suitable for multiplex analysis of a plurality of i + 1 species of target biological 
macromolecules, wherein each species of target biological macromolecule is 
differentially mass modified such that a biological macromolecule fragment of 
each species of target biological macromolecule can be distinguished from a 
biological macromolecule of each different species by IR-MALDI mass 
spectrometry. 

Processes for determining the nucleotide sequence of at least one 
species of nucleic acid are provided. Such a process can be performed by 
synthesizing complementary nucleic acids, which are complementary to the 
species of nucleic acid to be sequenced, starting from an oligonucleotide primer 
and in the presence of chain terminating nucleoside triphosphates, to produce 
four sets of base-specifically terminated complementary polynucleotide 
fragments; preparing a composition for IR-MALDI, containing the four sets of 
polynucleotide fragments and a liquid matrix, which absorbs infrared radiation; 
determining the molecular weight value of each polynucleotide fragment by 
IR-MALDI mass spectrometry; and determining the nucleotide sequence of the 
species of nucleic acid by aligning the molecular weight values according to 
molecular weight. Such a process is particularly suitable to multiplex analysis 
of a plurality of i + 1 species of nucleic acids, which can be sequenced 
concurrently using i+1 primers, wherein one of the i + 1 primers is an 
unmodified primer or a mass modified primer and the other i primers are mass 
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modified primers, and wherein each of the i + 1 primers can be distinguished 
from the other by IR-MALDI mass spectrometry. 

A sequence of a target nucleic acid also can be determined by 
hybridizing at least one partially single stranded target nucleic acid to one or 
5 more nucleic acid probes, each probe containing a double stranded portion, a 
single stranded portion, and a determinable variable sequence within the single 
stranded portion, to produce at least one hybridized target nucleic acid, then 
preparing a composition containing the hybridized target nucleic acid and a 
liquid matrix, which absorbs infrared radiation; and determining a sequence of 

10 the hybridized target nucleic acid by IR-MALDI mass spectrometry based on the 
determinable variable sequence of the probe to which the target nucleic acid 
hybridized. If desired, the steps of the process can be repeated a sufficient 
number of times to determine an entire sequence of a target nucleic acid and, 
where a plurality of target nucleic acids are to be sequenced, the one or more 

15 nucleic acid probes can be immobilized in an array. If desired, the hybridized 

target nucleic acid can be ligated to the determinable variable sequence prior to 
preparing the composition for IR-MALDI. 

A process for determining the sequence of a target biological 
macromolecule also can be performed by generating at least two biological 

20 macromolecule fragments from the target biological macromolecule, then 

preparing a composition containing the biological macromolecule fragments and 
a liquid matrix, which absorbs infrared radiation; and analyzing the biological 
macromolecule fragments in the composition by IR-MALDI mass spectrometry, 
thereby determining the sequence of the target nucleic acid molecule. Such a 

25 process is particularly useful for ordering two or more portions of a biological 
macromolecule sequence within a larger sequence. 

"Also, provided are compositions for IR-MALDI that contain a liquid 
matrix, which absorbs infrared radiation, and a biological macromolecule. In 
particular, the biological macromolecule and the liquid matrix are present in a 

30 ratio of about 1 0" 4 to 1 0' 9 biological macromolecule to liquid matrix in the 
composition. Also provided are these compositions in which the biological 
macromolecule is present in an amount less than about 10 picomoles of 
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bioJogical macromolecule, preferably about 100 attomoles to about 1 picomole 
of biological macromolecule. The compositions can further include an additive 
that facilitates detection of the nucleic acid by IR-MALDI. Supports {or 
substrates) on which the compositions are deposited are provided. 
BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1A to 1C show mass spectra of a synthetic DNA 70-mer. Figure 
1 A shows ultraviolet matrix assisted laser desorption ionization (UV-MALDI) and 
detection by a linear time-of-flight (TOF) instrument using delayed extraction 
and a 3 hydroxypicolinic acid (3HPA) matrix (sum of 20 single shot mass 
spectra); Figure 1 B shows UV-MALDI reflectron (ref ) TOF spectrum, using 
delayed extraction and a 3 HP A matrix (sum of 25 single shot mass spectra); 
Figure 1C shows IR-MALDI-refTOF spectrum, using delayed extraction and a 
glycerol matrix, (sum of 15 single shot mass spectra). 

Figures 2A to 2D show IR-MALDI refTOF mass spectra using a 2.94//m 
wavelength and a glycerol matrix. The spectra are as follows: Figure 2A - a 
synthetic DNA 21 mer (sum of 10 single shot spectra); Figure 2B - a DNA 
mixture containing a restriction enzyme products of a 280-mer (87 kDA), a 
360-mer (112 kDa), a 920-mer (285 kDa) and a 1400-mer (433 kDa) (sum of 
10 single shot spectra); Figure 2C - DNA mixture; restriction enzyme products 
of a 130-mer (approximately 40 kDa), a 640-mer (198 kDa) and a 2180-mer 
(674 kDa) (sum of 20 single shot spectra); Figure 4D - an RNA 1 206-mer 
(approximately 387 kDa) (sum of 1 5 single shot spectra). Ordinate scalings are 
intercomparable. 

Figures 3A to 3C show the spectra of a 515-mer double stranded PCR 
DNA product. Total amounts of sample were loaded, as follows: 
Figure 3A - 300 fmol (single shot spectrum); Figure 3B - 3 fmol (single shot 
spectra); Figure 3C - 300 attomol (sum of 25 single shot spectra). Obtained 
using an IR-MALDI refTOF, wherein the laser emitted at a wavelength of 
2.94 fjm using a glycerol matrix. 
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DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

All patents, patent applications and publications cited herein are 
incorporated herein by reference. The meaning of certain terms and phrases 
5 used in the specification and claims are provided below. Unless defined 

otherwise, all technical and scientific terms used herein have the same meaning 
as is commonly understood by one of skill in the art to which the subject matter 
belongs. 

As used herein, a biological macromolecule refers to a molecule, which 

10 typically may be found in a biological source. Biological macromolecules include 
biopolymers, which are molecules containing monomeric subunits, which 
subunits can be the same or different. Macromolecules thus include molecules, 
such as peptides, proteins, small organics, oligonucleotides or monomeric units 
of the peptides, organics, nucleic acids and other macromolecules. A 

15 monomeric unit refers to one of the constituents from which the resulting 

molecule is built. Thus, monomeric units include, nucleotides, amino acids, and 
pharmacophores from which small organic molecules are synthesized. 

Biopolymers are well known in the art and include, for example, nucleic 
acids, polypeptides, and carbohydrates, which are naturally occurring 

20 molecules. For purposes of the present disclosure, however, a biological 

macromolecule such as a biopolymer also can be a synthetic molecule that is 
based on or derived from a naturally occurring molecule or can be a 
macromolecular complex such as a nucleoprotein complex, protein-protein 
complex, or the like. When such molecule is a biopolymer, it contains at least 

25 one molecule containing monomeric subunits in association with a second 
molecule, which may or may not comprise monomeric subunits. Thus, a 
biopolymer can be, for example, a nucleic acid sequence containing a bond 
other than a phosphodiester bond between two or more nucleotides; or a 
polypeptide containing one or more mass modified amino acids; or a DNA 
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binding protein in association with a nucleic acid sequence containing the DNA 
binding protein recognition site or a variant thereof. The monomeric subunits of 
a biopolymer can be, for example, the four nucleotides that generally 
comprise DNA, or the twenty amino acids that generally comprise a 
5 polypeptide, or the various sugars that comprise carbohydrates, or derivatives, 
analogs or mimetics of such naturally occurring monomer subunits. Other 
biological macromolecules include lipids, glycopolypeptides, 
phoshpopolypeptides, peptidoglycans, oligonucleotides, polysaccharides, 
peptidomimetics, peptide analgos, nucleic acid analogs and other nucleic acid 

10 structures including triple helices. 

As used herein, large, biological macromolecules with reference to 
proteins refer to proteins that are approximately larger than bovine serum 
albumim ( i.e. . greater than about 65 kD). 

As used herein, analyze means to identify or detect a target molecule in 

15 a sample or determination of physical or determining identifying structural 

characteristics, such as the presence or absence of a mutation or mass of the 
nucleoide, or any method in which a property of a biological macromolecule is 
assessed using IR MALDI. 

As used herein, the term "biological sample" refers to any material 

20 obtained from a living source, for example, an animal such as a human or other 
mammal, a plant, a bacterium, a fungus, a protist or a virus. The biological 
sample can be in any form, including a solid material such as a tissue, cells, a 
cell pellet; a biological fluid such as urine, blood, saliva, amniotic fluid, exudate 
from a region of infection or inflammation; a mouth wash containing buccal 

25 cells; a cell extract, or a biopsy sample. 

As used herein, the term "polymorphism" refers to the coexistence, in a 
population, of more than one form of an allele. A polymorphism can occur in a 
region of a chromosome not associated with a gene or can occur, for example, 
as an allelic variant or a portion thereof of a gene. A portion of a gene that 

30 exists in at least two different forms, for example, two different nucleotide 

sequences, is referred to as a "polymorphic region of a gene." A polymorphic 
region of a gene can be localized to a single nucleotide, the identity of which 
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As used herein, the term "corresponding known bioiogica, 

: Imo.ecule general* is used as a contro, for comparison to a second 
bio ,ogica, macromo,ecu,e. particuiariv a targe, biologica, .acromo eouie. Bv 
nf a taraet biological macromolecule with a 
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20 . STOP codon. the se q uence of the target po,ype P t,de can be substant,a„v 
different from that of the corresponding known polypeptide. _ 

With respect to a nucieic acid, a target nuc.eic acd can be. for example, 
. DNA moiecule that is obtained from a sublet, such as as prostate cance 
patien, and includes the polymorphic region tha, demonstrates 
25 c eotide seguence associated with prostate cancer, and the corresp n.ng 
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nucleic acid can be the nucleotide sequence of an allele that is present in the 
majority of subjects in a relevant population. 

A target biological macromolecule can be a fragment of a larger 
biological macromolecule and can be produced by contacting the larger 
5 biological macromolecule with an appropriate fragmenting agent. 

As used herein, the term "fragmenting agent" means a physical 
chemical or biochemical agent that, upon contacting a biological macromolecule 
breaks the biological macromolecule into at feast two separate portions. In 
general, a fragmenting agent is specific for a particular type of biological 
10 macromolecule, for example, a peptidase, which cleaves a polypeptide- a 
nuclease, which cleaves a nucleic acid molecule; or a glycoside, which 
cleaves a carbohydrate. Non-specific fragmenting agents also are well known 
and .nclude, for example, physical agents such ionizing radiation or sonication 
Contacfng a biological macromolecule with a fragmenting agent produces 
5 fragments of the biological macromolecule. 

As used herein, the term "fragment," when used with reference to a 
b,o«ogica. macromolecule, means a portion of the biological macromolecule that 
has a lower molecular mass than the entire biological macromolecule. A 
fragment of a biological macromolecule can be one or more of the subunits that 
0 comprise the bio.ogica. macromolecule, or can be portions of the biological 
macromolecule lacking one or more subunits, including de.etion fragments. 

A fragment of a polypeptide, for example, generally is produced by 
specific chemical or enzymatic degradation of the polypeptide. Where chemical 
or enzymatic cleavage occurs in a sequence specific manner, the production of 
fragments of a polypeptide is defined by the primary amino acid sequence of the 
polypept.de. Fragments of a polypeptide can be produced, for example by 
contacting the polypeptide, which can be immobilized to a solid support with a 
chem.ca. agent such as cyanogen bromide, which cleaves a polypeptide at 
meth.onine residues, or hydroxyzine at high pH, which can cleave an Asp-Gly 
pept.de bond; or with a peptidase, for example, an endopeptidase such as 
tryps,n, which cleaves a polypeptide at Lys or Arg residues, or an exopeptidase 
such as carboxylase, which produces one or more free amino acids which 
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ha „e been reused from .he carboxy terminus of the poiypept.de. and delet.on 
fragments of the polypeptide the, lacks the one or more amino acds. 

The term "deletion fragment" refers to a fragment of a b.olog.cal 
maeromoieeuie that remains following sequential cleavage of a subuni, from a 
g .erminus o, the biological maeromoieeuie. The term "nested se, of eiet.on 
fragments" refers to a popuiation of deletion fragments that results from 
sequential eleavage of subunits from a bioiogioa, maeromoieeuie. A nested se, 
e, deletion fragments generally oontains at least one deletion fragment that ^ 
terminates in eaob subunit o, at leas, a portion of the bioiogioa, maeromoieeuie . 
n0 thereby allowing sequencing of the bioiogioa, maeromo,eeu,e. Thus, as many as 
N deleL fragments oan be.produoed from a bioiogioa, maeromoieeuie wher 
" H - is ,he number o, subunits in the bioiogioa, maoromo,eou,e. a„hough fewer 
th an N de,e,ion fragments oan be produoed. „ should be recognized that a 
"nested set" of nuoieio aeid fragments also oan be produoed us,ng, for example. 
1S by performing a chain-terminating polymerase reaction sueh as a d.deoxy 

sequencing method. 

in eomparison to the produetion of deletion fragments us.ng a 
fragmenting agent that cleaves a bioiogioa, maeromoieeuie from a ,erm,nus, 
treatment of a biological maeromoieeuie with a fragmenting agent tha, 
20 recognizes specific sites in the biological maeromoieeuie results ,n the ^ 
production in M + 1 fragments of the bioiogioa, macromo,ecu,e, where M 
the number of specific cleavage sites in the bioiogioa, maoromo,ecu,e. For 
example, treatment of a polypeptide having four internal and interspersed 
methionine residues with cyanogen bromide results in the production ,n f.ve 
25 fragments of the polypeptide. 

Fragments of nucleic acids, carbohydrates, or other biolog.cal 
macromoiecules also can be produced. For example, exonucleases. includmg 
DNAses and RNAses, and endonucieases. including restriction endonucleases, 
can be used to produce fragments of a nucleic acid molecule (see Sambrook e, 
30 a,.. Mofecu/a, Coning: A /aWfory manua/ (Cold Spring Harbor Laboratory 

Press 1989,, listing nucleic acid fragmenting agen,sh The choice of a nuclease 
,o produce nucleic acid fragments wil, depend on the process being performed 
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25 



30 



and the characteristics of the nucleic acid molecule, for example, whether it is 
DNA or RNA and whether, if DMA, it contains recognition sites, if necessary for 
action by the nuc.ease. Similarly, fragments of carbohydrates can be produced 
using enzymes such as exog.ycosidases or endoglycosidases, for example 
5 amylases, which can produce fragments of carbohydrates containing 
ff-1,4-g|ycosydic bonds (see U.S. Patent No. 5,821,063). 

A nested set of de.etion fragments of a target biological macromo.ecu.e 
can be produced using an agent that cleaves the biological macromo.ecu.e from 
a terminus. 

3 As used herein, the term "agent that cleaves a biological macromo.ecu.e 

u ni latera,.y from a terminus". refers to a physica., chemica. or biological agent 
for sequentially removing subunits from one end of a biological macromo.ecu.e 
A b IO ,ogica. agent that c.eaves a biologica. macromo.ecu.e unilaterally from a 
terminus is exemp.ified by an exopeptidase such as carboxypeptidase Y, which 
> sequentially c.eaves amino acids from the carboxy. terminus of a polypeptide 
(see U.S. Patent No. 5,792,664; International Pub.. WO 96/36732,, or by an 
exonuc.ease such as exonuclease which sequential c.eaves nuc.eotides 
from the 3<-hydroxy. terminus of a doub.e stranded DNA (see Internationa. 
Pub.. WO 94/21822,. A physica. agent is exemp.ified by a iight source for 
example, a laser, which can cleave a termina. subunit from a biologica. 
macromo,ecu.e, particularly where the subunit is bound to the biologica. 
macromolecule through a photolabi.e bond. A chemica. agent is exemp.ified by 
Phenylisothiocyanate (Edman's reagent,, which, in the presence of an acid 
cleaves an amino terminal amino acid from a polypeptide. 

As used herein, the residues of naturally occurring a-amino acids are the 
residues of those 20 a-amino acids found in nature that are incorporated into 
protein by the specific recognition of the charged tRNA molecule with its 
cognate mRNA codon in humans. 

As used herein, non-natural.y occurring amino acids refer to amino acids 
that are not geneticaMy encoded. Preferred such non-natura..y occurring amino 
acids herein include those with unsaturated side chains. 
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As used herein, the term "polypeptide" means at least two amino acids, 
or amino acid derivatives, which can be mass modified amino acids or non- 
naturally-occurring amino acids, that are linked by a peptide bond, which can be 
a modified peptide bond. Exemplary polypeptides include, but are not limited 
5 to, native proteins, gene products, protein conjugates, mutant or polymorphic 
polypeptides, post-translationally modified proteins, genetically engineered gene 
products including products of chemical synthesis, in vitro translation, cell- 
based expression systems, including fast evolution systems involving vector 
shuffling, random or directed mutagenesis and peptide sequence randomization, 

10 oligopeptides, antibodies, enzymes, receptors, regulatory proteins, nucleic acid- 
binding proteins, hormones, or protein products of a display method such as 
phage or bacterial display methods. 

A polypeptide can be translated from a nucleotide sequence that is at 
least a portion of a coding sequence, or from a nucleotide sequence that is not 

15 naturally translated due, for example, to its being in a reading frame other than 
the coding frame or to its being an intron sequence, a 3' or 5' untranslated 
sequence, or a regulatory sequence such as a promoter. A polypeptide also can 
be chemically synthesized and can be modified by chemical or enzymatic 
methods following translation or chemical synthesis. The terms "protein," 

20 "polypeptide" and "peptide" can be used interchangeably herein when referring 
to a translated nucleic acid, for example, a gene product, although "peptides" 
generally are smaller than "polypeptides" and "proteins" often can have 
post-translational modifications. 

As used herein, the term "nucleic acid" refers to a polynucleotide 

25 containing at least two covalently linked nucleotide or nucleotide analog 

subunits. A nucleic acid can be a deoxyribonucleic acid (DNA), a ribonucleic 
acid (RNA), or an analog of DNA or RNA, such as PNA, and can contain, for 
example, one or more nucleotide analogs or a covalent linkage (backbone) other 
than a phosphodiester bond, for example, a thioester bond, a phosphotriester 

30 bond, or a peptide bond (peptide nucleic acid; PNA; see, for example, Tarn et 
al. . Nucl. Acids Res. 22:977-986 (1994); Ecker and Crooke, BioTechnology 
13:351360 (1995)); triple helices are also contemplated. The nucleic acid can 
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10 



be single-stranded, double-stranded, or a mixture thereof. For purposes herein, 
unless specified otherwise, the nucleic acid is double-stranded or it is apparent 
from the context. Nucleotide analogs are commercially available and methods 
of preparing polynucleotides containing such nucleotide analogs are well known 
(Lin et ai, Nucl. Acids Res. 22:5220-5234 (1 994); Jellinek et al, Biochemistry 
34: 1 1 363-1 1 372 (1 995); Pagratis et aL, Nature BiotPrhnnl 1 5:68-73 (1 997)). 

A nucleic acid can be single stranded or double stranded, including, for 
example, a DNA-RNA hybrid. A nucleic acid also can be a portion of a longer 
nucleic acid molecule, for example, a portion of a gene containing a polymorphic 
region. The molecular structure of a nucleic acid, for example, a gene or a 
portion thereof, is defined by its nucleotide content, including deletions, 
substitutions or additions of one or more nucleotides; the nucleotide sequence; 
the state of methylation; or any other modification of the nucleotide sequence. 
Although a nucleic acid contains two or more nucleotides or nucleotide analogs 
15 linked by a covalent bond, including single stranded or double stranded 

molecules, it should be recognized that a "fragment" of a nucleic acid, which 
can be produced as discussed above, can be as small as a single nucleotide. 
The terms "polynucleotide" and "oligonucleotide" also are used herein to mean 
two or more nucleotides or nucleotide analogs linked by a covalent bond, 
although oligonucleotides such as PCR primers generally are less than about 
fifty to one hundred nucleotides in length. 

As used herein, the phrase "determining the identity of a target 
biological macromolecule" refers to determining at least one characteristic of the 
biological macromolecule, which can be a nucleic acid, polypeptide or other 
biological macromolecule. Determining the identity of a biological 
macromolecule can include, for example, determining the molecular mass or 
charge of the biological macromolecule; or determining the identity of at least 
one subunit, or of a subunit sequence of the biological macromolecule; or 
determining a particular pattern of fragments of the biological macromolecule. 
For example, where the biological macromolecule is a nucleic acid, determining 
the identity of the target nucleic acid can include determining at least one 
nucleotide of the target nucleic acid, or determining the number of nucleotide 



20 
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repeats present in a sequence of tandem nucleotide repeats. Similarly, where 
the target biological macromolecule is a polypeptide, determining the identity of 
the target polypeptide can include determining at least one amino acid, or a 
particular pattern of peptide fragments of the target polypeptide, for example, 
5 following treatment of the polypeptide with an endopeptidase. Determining the 
identity of a target biological macromolecule is performed by subjecting the 
target biological macromolecule, if necessary, to a particular reaction, as 
appropriate; preparing a composition containing target biological macromolecule 
or reaction product thereof and a liquid matrix, which absorbs IR radiation; and 

10 analyzing the target biological macromolecule or reaction product thereof by IR- 
MALDI mass spectrometry. 

The terms "infrared radiation" and "infrared wavelength" refer to 
electromagnetic wavelengths that are longer than those of red light in the visible 
spectrum and shorter than radar waves, generally wavelengths within the range 

15 of about 760 nm to about 50 //m. An appropriate infrared wavelength can be 
generated using a laser, as disclosed herein. 

As used herein, the term "liquid matrix" means a material that has a 
sufficient absorption at the wavelength of the laser to be used in performing 
desorption and ionization ( i.e. an IR emitting laser) and that is a liquid at room 

20 temperature (about 20°C, 1 atm). The contemplated liquids are those that can 
form vitreous solids or glasses in the solid state as opposed to a crystalline 
structure, such as that which forms when a matrix such as picolinic acid or 
3HPA is dried. Vitreous solids and glasses do not form solid crystalline 
heterogenous structures, but rather retain properties of liquids that derive from 

25 their lack of ordered structure. In addition, such liquid matrices form a 

homogenous layer when applied to the surface of a substrate or support. Thus, 
for purposes herein, liquid matrices are relatively non-volatile materials that are 
biocompatible, particularly compatible with nucleic acids and/or proteins, and 
include, but are not limited to, alcohols, including glycols and polyols, such as 

30 glycerol, sugars, such as sucrose, mannose, galactose, and other sugars as well 
as polymeric sugars, ethylene glycol, propylene glycol, trimethylolpropane, 
pentaerythritol, dextrose, methylglycoside or sorbitol, sucrose, mannose and 
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other such materials that in the solid state can form glasses rather than 
crystalline structures. Also included is "glassy" water, which state occurs 
under conditions in which very small volumes, Le,, submicroliters, particular 
nanoliters or less, are dispensed. Other liquid matrices include, but are not 
5 limited to triethanolamine, lactic acid, 3-nitrobenzylalcohol, diethanolamine, 
DMSO, nitropheynloctylether (3-NPOE), 2,2'dithiodiethanol, tetraethyleneglycol, 
dithiotrietol/erythritol (DTT/DTE), 2,3-dihydroxy-propyl-benzyl ether, a- 
tocopherol, and thioglycerol. Other suitable "liquid" matrices are set forth 
below. 

For absorption purposes, the liquid matrix can contain at least one 
chromophore or functional group that strongly absorbs infrared radiation. 
Examples of appropriate functional groups include nitro, sulfonyl, sulfonic acid, 
sulfonamide, nitrile or cyanide, carbonyl, aldehyde, carboxylic acid, amide, 
ester, anhydride, ketone, amine, hydroxy!, aromatic rings, dienes and other 
conjugated systems. A liquid matrix, which absorbs IR radiation, including a 
composition containing a biological macromolecule to be analyzed by IR-MALDI 
and a liquid matrix, can contain an additive that facilitates IR-MALDI analysis of 
the biological macromolecule. 

As used herein, appropriate viscosity, refers to the viscosity for 
dispensing glass-type liquid matrices and means that it can be dispensed as a 
small volume and evenly distribute over a small surface area in an thin layer. 

As used herein, the term "additive" means a material that facilitates IR- 
MALDI analysis of a biological macromolecule. For example, an additive can 
facilitate solubility of the biological macromolecule in a composition containing a 
liquid matrix. An additive also can be a compound or compounds that have a 
high extinction coefficient (E) at the laser wavelength used for desorption and 
ionization, for example, dinitrobenzenes or polyenes. Additives also include 
compounds that alter the ionic strength of the matrix/sample mixture or the 
matrix. Exemplary salt additives include, but are not limited to, ammonium 
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salts and salts of amines. Exemplary salt additives for this purpose include NH 4 - 
acetate and Tris-HCI. 

Where the biological macromolecule to be analyzed by IR-MALDI is a 
nucleic acid, for example, an additive can be a compound that acidifies the 
5 liquid matrix, thereby inducing dissociation of double stranded nucleic acids or 
denaturing a secondary structure of a nucleic acid such as tRNA or other single 
stranded nucleic acid. An additive also can minimize salt formation between the 
matrix and the biological macromolecule and can be, for example, a material 
that conditions the biological macromolecule. When it is desirable to analyze or 

10 detect a double-stranded nucleic acid by IR-MALDI, the additive can be a 

substance that stablizes the double-stranded molecule or reduces denaturation 
of the double-stranded nucleic acid, but that is generally compatible with mass 
spectrometric analysis. Such additives include, but are not limited to, salts. 
Preferred salt additives include ammonium salts and salts of amines. Exemplary 

15 salt additives for this purpose include NH 4 -acetate and Tris-HCI. 

The matrix can be treated by further purification to remove other organic 
contaminants, including harmful derivatives and other by-products of the 
production process. 

A biological macromolecule or fragment thereof, particularly a target 

20 biological macromolecule, can be conditioned prior to IR-MALDI mass 
spectrometry. 

As used herein, the term "conditioned" or "conditioning," when used in 
reference to a biological macromolecule, means that the biological 
macromolecule is modified so as to decrease the amount of IR radiation required 

25 to ionize or volatilize the biological macromolecule, to minimize the likelihood of 
undesirable fragmentation of the biological macromolecule, or to increase the 
resolution of a mass spectrum of the biological macromolecule or fragments 
thereof. Resolution of a mass spectrum of a target biological macromolecule or 
fragment thereof can be increased by conditioning the biological macromolecule 

30 prior to performing IR-MALDI mass spectrometry. Conditioning can be 

performed at any stage prior to IR-MALDI mass spectrometry, particularly while 
the biological macromolecule is immobilized to a substrate. Conditioning 
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includes any process that achieves these results, and includes, but is not limited 
to, subjecting the macromolecule to ion exchange or other process that provides 
for a uniform charge distribution, mass modification, modification of the 
phosphodiester backbone of a nucleic acid, removal of negative charge from the 
5 phosphodiester backbone, cation exchange, further purification, and any other 
such process known to those of skill in the art to achieve conditioning. 

Conditioning of a biological macromolecule will depend, in part, on the 
biochemical nature of the biological macromolecule. For example, a biological 
macromolecule can be conditioned by treatment with a cation exchange material 
10 or an anion exchange material, which reduces the charge heterogeneity of the 
biological macromolecule, thereby eliminating peak broadening due to 
heterogeneity in the number of cations {or anions) bound to the target biological 
macromolecule. A polypeptide, for example, can be conditioned by treatment 
with an alkylating agent such as alkyliodide, iodoacetamide, iodoethanol, or 
15 2,3-epoxy-1-propanol, which prevents the formation of disulfide bonds. Such 
alkylating agents also can be used to condition a nucleic acid by transforming 
the monothiophosphodiester bonds to phosphotriester bonds. A polypeptide 
also can be conditioned by converting charged amino acid side chains to 
uncharged derivatives by contact with trialkylsilyl chlorides, which also can be 
20 used to condition a nucleic acid by transforming phosphodiester bonds to 

uncharged derivatives. Biological macromolecules also can be conditioned by 
incorporating modified subunits that are more stable than the corresponding 
unmodified subunits, for example, the substitution of N7- or N9-deazapurine 
nucleotides in a target nucleic acid, thereby minimizing the likelihood of 
25 fragmentation of the biological macromolecule. 

The processes disclosed herein provide methods for analyzing a plurality 
of biological macromolecules in one or a few samplings, for example, by 
multiplex analysis. 

As used herein, the term "multiplex" refers to simultaneously determining 
30 the identity of at least two target biological macromolecules by IR-MALDI mass 
spectrometry. For example, where a population of different target biological 
macromolecules are present in an array on a microchip or other substrate. 
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multiplexing can be used to determine the identity of a plurality of target 
biological macromolecules. Multiplexing can be performed, for example, by 
differentially mass modifying each different biological macromolecule of interest, 
then using IR-MALDI mass spectrometry to determine the identity of each 
5 different biological macromolecule. Multiplex analysis provides the advantage 
that a plurality of target biological macromolecules can be identified in as few as 
a single IR-MALDI mass spectrum, as compared to having to perform a separate 
mass spectrometric analysis for each individual target biological macromolecule. 
"Multiplexing" can be achieved by several different methodologies. For 
10 example, several mutations can be simultaneously detected on one target 
sequence by employing corresponding detector (probe) molecules (e.g. 
oligonucleotides or oligonucleotide mimetics). The molecular weight differences 
between the detector oligonucleotides D1 , D2 and D3 must be large enough so 
that simultaneous detection (multiplexing) is possible. This can be achieved 
15 either by the sequence itself (composition or length) or by the introduction of 
mass-modifying functionalities into the detector oligonucleotide. Mass 
modifying moieties can be attached, for instance, to either the 5'-end of the 
oligonucleotide, to the nucleobase (or bases), to the phosphate backbone, and 
to the 2'-position of the nucleoside (nucleosides) or/and to the terminal 3'- 
20 position. Examples of mass modifying moieties include, for example, a halogen, 
an azido, or of the type, XR, wherein X is a linking group and R is a mass- 
modifying functionality. The mass-modifying functionality can thus be used to 
introduce defined mass increments into the oligonucleotide molecule. 
The mass-modifying moiety, M, can be attached either to the 
25 nucleobase, in case of, for example, c 7 -deazanucleosides also to C-7, to the 
triphosphate group at the alpha phosphate, or to the 2'-position of the sugar 
ring of the nucleoside triphosphate. Furthermore, the mass-modifying 
functionality can be added so as to affect chain termination, such as by 
attaching it to the 3'-position of the sugar ring in the nucleoside triphosphate. 
30 As another examplary embodiment, various mass-modifying functionalities, R, 
other than oligo/polyethylene glycols, can be selected and attached via 
appropriate linking chemistries, X. A simple mass-modification can be achieved 
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by substituting H for halogens like F, CI, Br and/or I, or pseudohalogens such as 
SCN, NCS, or by using different alkyl, aryl or aralkyl moieties such as methyl, 
ethyl, propyl, isopropyl, t-butyl, hexyl, phenyl, substituted phenyl, benzyl, or 
functional groups such as CH 2 F, CHF 2 , CF 3 , Si(CH 3 ) 3 , Si(CH 3 ) 2 (C 2 H 5 ), 
Si(CH 3 )(C 2 H 5 ) 2 , Si{C 2 H 5 ) 3 . Yet another mass-modification can be obtained by 
attaching homo- or heteropeptides through the nucleic acid molecule (e.g. 
detector (D)) or nucleoside triphosphates. One example useful in generating 
mass-modified species with a mass increment of 57 is the attachment of 
oligoglycines, e.g. mass-modifications of 74(r=1, m = 0), 131 {r=1, m = 2), 
188 (r=1, m— 3), 245 (r=1, m = 4) are achieved. Simple oligoamides also can 
be used, e.g., mass-modifications of 74 (r=1, m = 0), 88 (r = 2, m = 0), 102 
(r = 3, m = 0), 116 <r = 4, m = 0), etc. are obtainable. . The mass 
modifications serve, not only to aid in multiplexing, but to enhance or aid in 
resolving mass spectrometry of fragments (Le., mass modification aids in 
"conditioning" the nucleic acids for analyis. Other chemistries can be used in 
the mass-modified compounds, as for example, those described in 
Oligonucleotides and Analogues, A Practical Approach, F. Eckstein, editor, IRL 
Press, Oxford, 1991 and are known to those of skill in the art of mass 
spectrometry. 

As used herein, the term "plurality," when used in reference to biological 
macromolecules, means two or more biological macromolecules, each of which 
has a different subunit sequence. The difference in sequences can be due to a 
naturally occurring variation among the sequences, for example, to an allelic 
variation in a nucleotide or an encoded amino acid, or can be due to the 
introduction of particular modifications into various sequences, for example, the 
differential incorporation of mass modified nucleotides or amino acids into each 
nucleic acid or polypeptide, respectively, in the plurality. 

The processes as disclosed herein can be performed using an isolated 
biological macromolecule. 

As used herein, the term "isolated" means that a biological 
macromolecule is substantially separated from macromolecules normally 
associated with the biological macromolecule in its natural state. An isolated 
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nucleic acid molecule, for example, is substantially separated from the cellular 
material normally associated with it in a cell or, as relevant, can be substantially 
separated from bacterial or viral material; or from culture medium where 
produced by recombinant DNA techniques; or from chemical precursors or other 
5 chemicals where the nucleic acid is chemically synthesized. In general, an 

isolated nucleic acid molecule, which can be a fragment of a larger nucleic acid, 
is at least about 50% enriched with respect to its natural state, and generally is 
about 70% to about 80% enriched, particularly about 90% or 95% or more. 
Preferably, an isolated nucleic acid constitutes at least about 50% of a sample 
10 containing the nucleic acid, and can be at least about 70% or 80% of the 

material in a sample, particularly at least about 90% to 95% or greater of the 
sample. 

Similarly, an isolated polypeptide can be identified based on its being 
enriched with respect to materials it naturally is associated with or its 
15 constituting a fraction of a sample containing the polypeptide to the same 

degree as defined above, i.e., enriched at least about 50% with respect to its 
natural state or constituting at least about 50% of a sample containing the 
polypeptide. An isolated polypeptide, for example, can be purified from a cell 
that normally expresses the polypeptide or can produced using recombinant 
20 DNA methodology, and can be a fragment of a larger polypeptide. 

A biological macromolecule can be isolated using a reagent that interacts 
specifically with the biological macromolecule or with a tag attached to the 
biological macromolecule. For example, a target polypeptide can be isolated 
using a reagent that interacts specifically with the target polypeptide, with a 
25 peptide tag (i.e. peptide that can serve to specifically bind to a reagent, such as 
a column) fused to the target polypeptide, or with a peptide tag conjugated to 
the target polypeptide. 

As used herein, the term "reagent" means a ligand or a ligand binding 
molecule that interacts specifically with a particular ligand binding molecule or 
30 ligand, respectively. The term "tag peptide" or "peptide tag" is not to be 

confusedwith a mass tag, and is used herein to mean a peptide, for which a 
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reagent is available. The term "tag" refers more generally to any molecule, for 
which a reagent is available and, therefore, includes a tag peptide. 

As used herein, reagent can be an antibody that interacts specifically 
with an epitope of a target biological macromolecule, for example, a 
polypeptide, or with an epitope of a tag attached to the target biological 
macromolecule. For example, a reagent can be an anti-myc epitope antibody, 
which can interact specifically with a myc epitope fused to a target polypeptide. 
A reagent also can be, for example, a metal ion such as nickel ion or cobalt ion, 
which interacts specifically with a polyhistidine tag peptide; or zinc, copper or, 
for example, a zinc finger domain, which interacts specifically with a 
polyarginine or polylysine tag peptide; or a molecule such as avidin, streptavidin 
or a derivative thereof, which interacts specifically with a tag such as biotin or a 
derivative thereof (see International Publ. WO 97/4361 7, which describes, for 
example, methods for dissociating biotin compounds, including biotin and biotin 
analogs conjugated (biotinylated) to a polypeptide, from biotin binding 
compounds, including avidin and streptavidin, using amines, particularly 
ammonia). 

A tag such as biotin also can be incorporated into a target nucleic acid, 
thereby allowing isolation of the target nucleic acid using a reagent such as 
avidin or streptavidin. In addition, a target nucleic acid can be isolated by 
hybridization to reagent containing a complementary nucleic acid sequence, 
which can be immobilized to a solid support such as beads, for example, 
magnetic beads, if desired. 

The term "interacts specifically," when used in reference to a reagent 
and a target biological macromolecule sequence or a tag to which the reagent 
binds, indicates that binding occurs with relatively high affinity. As such, a 
reagent has an affinity of at least about 1 x 10 6 M \ generally, at least about 
1x10 ' M °' and ' ln Particular, at least about 1 x 10 8 M"\ for the particular 
biological macromolecule sequence or tag. A reagent the interacts specifically, 
for example, with a particular tag peptide primarily binds the tag peptide, 
regardless of whether other unrelated molecules are present and, therefore, is 
useful for isolating the tag peptide, including a target polypeptide fused to the 
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tag peptide, from a sample containing the target polypeptide, for example, from 
an in vitro translation reaction. Similarly, a reagent complementary nucleic acid 
sequence that interacts specifically with a target nucleic acid selectively binds 
the target nucleic acid, but not unrelated nucleic acid molecules. 
5 A hybridizing nucleic acid sequence, which generally is an 

oligonucleotide, is at least nine nucleotides in length, such sequences being 
particularly useful as primers for the polymerase chain reaction (PCR), and can 
be at least fourteen nucleotides in length or, if desired, at least seventeen 
nucleotides in length, such nucleotide sequences being particularly useful as 

10 hybridization probes, as well as for PCR. It should be recognized that the 

conditions required for specific hybridization of an oligonucleotide, for example, 
a PCR primer, with a nucleic acid sequence, for example, a target nucleic acid, 
depends, in part, on the degree of complementarity shared between the 
sequences, the GC content of the hybridizing molecules, and the length of the 

15 antisense nucleic acid sequence, and that conditions suitable for obtaining 

specific hybridization can be calculated based on readily available formulas or 
can be determined empirically (Sambrook et al.. Molecular Cloning: A laboratory 
manual (Cold Spring Harbor Laboratory Press 1989); Ausubel et al.. Current 
Protocols in Molecular Biology (Green Publ., NY 1989)). 

20 It can be advantageous in performing a disclosed process to immobilize a 

biological macromolecule, for example, a target nucleic acid or a target 
polypeptide, on a a substrate, particularly a solid support, such as a bead, 
microchip, glass or plastic capillary, or any surface, particularly a flat surface, 
which can contain a structure such as wells, pins or the means by which the 

25 target macromolecule is constrained at a site. A biological macromolecule can 
be conjugated to a solid support by various means, including, for example, by a 
streptavidin or avidin to biotin interaction; a hydrophobic interaction; by a 
magnetic interaction using, for example, functionalized magnetic beads such as 
DYNABEADS, which are streptavidin coated magnetic beads (Dynal Inc.; Great 

30 Neck NY); by a polar interaction such as a "wetting" association between two 
polar surfaces or between oligo/polyethylene glycol; by the formation of a 
covalent bond such as an amide bond, a disulfide bond, a thioether bond, or the 
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like; through a crosslinking agent; and through an acid-labile or photocleavable 
linker (see, for example, Hermanson, Bioconjugate Techniques (Academic Press 
1996)). In addition, a tag can be conjugated to biological macromolecule of 
interest, particularly to a target biological macromolecule. 

As used herein, the term "conjugated" or "immobilized" refers to an 
attachment, which can be a covalent attachment or a noncovalent attachment, 
that is stable under defined conditions. As disclosed herein, a biological 
macromolecule can be immobilized to a substrate, or a first substrate can be 
conjugated to second substrate. Immobilization of a biological macromolecule 
to a substrate can be direct or can be indirect through a linker, and can 
reversible or irreversible. A reversible immobilization can be reversed either by 
cleaving the attachment, for example, using light to cleave a photocleavable 
bond, or by subjecting the attachment to conditions that reverse the bond, for 
example, reducing conditions, which reverse a disulfide linkage. 

As used herein, the term "substrate" or "solid support" means a flat 
surface or a surface with structures, to which a functional group, including a 
biological macromolecule containing a reactive group, can be conjugated. The 
term "surface with structures" means a substrate that contains, for example, 
wells, pins or the like, to which a functional group, including a biological 
macromolecule containing a reactive group, can be attached. Numerous 
examples of solid supports (substrates) are disclosed herein or otherwise known 
in the art. 

A process as disclosed herein can be used to identify a subject that has 
or is predisposed to a disease or condition. As used herein, the term "disease" 
has its commonly understood meaning of a pathologic state in a subject. For 
purposes of the present disclosure, a disease can be due, for example, to a 
genetic mutation, a chromosomal defect or an infectious organism. The term 
"condition," which is to be distinguished from conditioning of a biological 
macromolecule, is used herein to mean any state of a subject, including, for 
example, a pathologic state or a state that determines, in part, how the subject 
will respond to a stimulus. The condition of a subject can be determined, in 
part, by determining a characteristic of the subject's genotype, which can 
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provide an indication as to how the subject will respond, for example, to a graft 
or to treatment with a particular medicament; or by detecting a particular 
biological macromolecule in a biological sample obtained from the subject, for 
example, expression of a carbohydrate associated with a particular disease. 
5 Accordingly, reference to a subject being predisposed to a condition can 
indicate, for example, that the subject has a genotype indicating that the 
subject will not respond favorably to a particular medicament or that the subject 
will reject a particular graft. 

Reference herein to an allele or an allelic variant being "associated" with 
10 a disease or condition means that the particular genotype is characteristic, at 
least in part, of the genotype exhibited by a population of subjects that have or 
are predisposed to the disease or condition. For example, an allelic variant such 
as a mutation in the BRCA1 gene is associated with breast cancer, and an allelic 
variant such as a higher than normal number of trinucleotide repeats in a 
15 particular gene is associated with prostate cancer. The skilled artisan will 

recognize that an association of an allelic variant with a disease or condition can 
be identified using well known statistical methods for sampling and analysis of a 
population. 

As used herein, compositions include mixtures of materials and as well 

20 as solutions. 

Except as otherwise disclosed, the practice of the processes described 
herein employs conventional techniques of cell biology, cell culture, molecular 
biology, transgenic biology, microbiology, recombinant DNA, and immunology, 
which are within the skill of the art and described, for example, in DNA Cloning, 
25 Volumes I and II (D.N. Glover, ed., 1985); Oligonucleotide Synthesis (M.J. Gait, 
ed., 1984); Mullis et aL. U.S. Patent No: 4,683,194; Nucleic Acid Hybridization 
(Hames and Higgins, eds., 1984); Transcription and Translation (Hames and 
Higgins eds., 1984); Culture of Animal Cells (R.I. Freshney; Alan R. Liss, Inc., 
1987); Immobilized Cells and Enzymes (IRL Press, 1986); B. Perbal, A Practical 
30 Guide to Molecular Cloning (1 984); Gene Transfer Vectors For Mammalian Cells 
(Miller and Calos, eds.; Cold Spring Harbor Laboratory 1987); Methods In 
Enzymology, Vols. 154 and 155 (WuetaL, eds.. Academic Press, NY), 
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Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, 
eds.; Academic Press London, 1987); Handbook Of Experimental Immunology, 
Volumes I to IV (Weir and Blackwell, eds., 1986); Manipulating the Mouse 
Embryo (Cold Spring Harbor Laboratory press, Cold Spring Harbor NY, 1986). 
> PROCESSES AND COMPOSITIONS FOR USE WITH IR MALDI 

The processes and compositions disclosed herein allow the detection, 
identification or characterization of biological macromolecules, including nucleic 
acids, polypeptides, and carbohydrates, as well as macromolecular complexes 
such as protein complexes and nucleoprotein complexes, by infrared (IR) matrix 
assisted laser desorption/ionization (MALDI) mass spectrometry. A composition 
for IR-MALDI is provided, the composition being a composition containing at 
least a biological macromolecule to be analyzed by IR-MALDI mass spectrometry 
and a liquid matrix, which absorbs IR radiation. Such a composition, which can 
be deposited on a substrate, is useful for determining a characteristic of a 
biological macromolecule by IR-MALDI mass spectrometry. 

Processes for analyzing a target biological macromolecule using IR- 
MALDI mass spectrometry also are provided, including, for example, processes 
for detecting a target biological macromolecule in a sample, particularly a 
biological sample; processes for determining the identity of a biological 
macromolecule such as the presence of a mutation or other genetic change in a 
nucleic acid or of an amino acid change in a polypeptide encoded by a nucleic 
acid having a genetic change; and processes for determining a sequence of a 
biological macromolecule. The processes disclosed herein allow the analysis by 
IR-MALDI mass spectrometry of one or more target biological macromolecules, 
either in separate, but related processes such as a high throughput process, 
where the biological macromolecules can be analyzed serially, or can be 
arranged in an array on a silicon wafer, for example, and analyzed in parallel; or 
in a single process using a multiplex format, where each biological 
macromolecule in a plurality is differentially identifiable, for example, due to 
differential mass modification of the biological macromolecules. 

The disclosed processes and compositions are based, in part, on the 
finding that high resolution mass spectra of large nucleic acid molecules (DNA 
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and RNA) can be obtained by desorbing and ionizing the nucleic acids in a liquid 
matrix using a laser that emits in the infrared electromagnetic wavelength. 
Accordingly, a process is provided for performing IR-MALDI mass spectrometry, 
containing mixing a nucleic acid composition with a liquid matrix to form a 
5 matrix/nucleic acid composition and depositing the composition onto a substrate 
to form a homogeneous, thin layer of matrix/nucleic acid composition. The 
nucleic acid containing substrate then can be illuminated with IR radiation of an 
appropriate wavelength to be absorbed by the matrix, so that the nucleic acid is 
desorbed and ionized, thereby emitting ion particles that can be extracted 

10 (separated) and analyzed by a mass analyzer to determine the mass of the 

nucleic acid. A process for analyzing a nucleic acid by mass spectrometry can 
be performed by depositing a composition containing the nucleic acid and a 
liquid matrix on a substrate, to form a homogeneous, thin layer of a nucleic 
acid/liquid matrix composition; illuminating the substrate containing the 

15 deposited composition with an infrared laser, so that the nucleic acid is 

desorbed and ionized; and mass separating and detecting the ionized nucleic 
acid using an appropriate mass separation and analysis format. 

Processes are provided for analyzing a target biological macromotecule, 
particularly a target nucleic acid, by preparing a composition containing the 

20 target biological macromolecule and a liquid matrix, which absorbs IR radiation, 
and analyzing the target biological macromolecule in the composition by IR- 
MALDI mass spectroscopy. The various processes disclosed herein allow a 
determination of the molecular mass of a target biological macromolecule, the 
detection or identification of a target biological macromolecule, which can be 

25 present in a biological sample, or the determination of a subunit sequence of a 
target biological macromolecule. Depending on the source of the target 
biological macromolecule, a process as disclosed herein can be useful, for 
example, for determining whether an individual has a disease or a predisposition 
to a disease, or for determining heredity, identity or compatibility of an 

30 individual (see International Publ. WO 98/20019). 

A target biological macromolecule, for example, a target nucleic acid 
molecule, can be obtained from a subject, particularly from a cell or tissue in the 
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subject or from a biological fluid, i.e., a biological sample. A target biological 
macromolecule can be a target nucleic acid molecule, or can be a target 
polypeptide, which can be obtained, for example, by in vitro translation of an 
RNA molecule encoding the target polypeptide; or by in vitro transcription of a 
nucleic acid encoding the target polypeptide, followed by translation, which can 
be performed in vitro or in a cell, where the nucleic acid to be transcribed is 
obtained from a subject. The processes disclosed herein provide fast and 
reliable methods for identifying or obtaining information about the target 
biological macromolecule. 

Exemplary Advantages of IR-MALDI in the Detection of Target Molecules 
Obtained from Biological Samples 

Biological samples containing a target molecule which have undergone 

some purification still are likely to contain extraneous contaminants (i.e., 

materials other than the target molecule) that are not present in a pure sample 

of target molecule. For example, extraneous proteins and salts may be present 

in partially purified preparations thereby making such preparations in reality 

"mixtures" as opposed to pure samples. Accordingly, mass resolution, 

accuracy, sensitivity and the signal-to-noise ratio become very critical 

parameters in mass spectrometric methods designed to detect the presence of a 

target molecule obtained from a biological sample. The mass spectrometric 

technique must be able to clearly resolve the target molecule, which may not be 

present in significant quantities, from the contaminant materials. 

Thus, the fact that a particular mass spectrometric method may be used 

to measure the mass of a relatively pure biological molecule is no guarantee that 

it will be applicable to the detection of target molecules obtained from a 

biological sample. Furthermore, because of the inherent differences in the 

various types of mass spectrometric methods (e.g., ESI and MALDI using 

different lasers and/or matrices), the fact that one mass spectrometric technique 

may be useful in the detection of target molecules obtained from a biological 

sample is no guarantee that another type would also be suitable for this 

purpose. Additionally, the fact that a particular mass spectrometric method or 

set of conditions may be used to detect one particular type of target molecule. 
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from a biological sample does not guarantee that it can be used effectively to 
detect another type of target molecule from a biological sample. For example, 
even different sizes and types of a single class of target molecule (e.g., single- 
stranded vs. double-stranded DNA) from a biological sample may or may not be 
5 detected by different mass spectrometric methods and conditions, just as 

completely different classes of target molecules, e.g., nucleic acids vs. proteins, 
from a biological sample may or may not be detected by different mass 
spectrometric methods and conditions. 

A comparison of proteins and nucleic acids reveals several differences 
10 that directly impact their amenability to analysis by mass spectrometry. For 
example, nucleic acids are typically more susceptible to fragmentation than 
proteins due to losses of nucleobases as a result of the labile N-glycosidic bond 
between the different bases and the deoxyribose moiety and to depurination. 
Spectra of nucleic acids reveal a greater tendency toward adduct formation than 
15 those of proteins. Furthermore, the relative ease of desorption/ionization 

appears to be greater for proteins as compared to nucleic acids since proteins 
tend to fold into defined structures whereas nucleic acids have less tertiary 
structure than proteins. 

As disclosed herein, IR-MALDI mass spectrometry has been found to be 
20 effective and advantageous in methods of detection of target molecules, 

particularly large target molecules, obtained from biological samples. This has 
been due in part to the recognition of the significance of defining the optimal 
parameters (for example, the particular combinations of laser, wavelength, 
matrix, additive, pulse width, beam profile, temperature and/or fluence) that 
25 provide the level of resolution, sensitivity, signal-to-noise level, etc., required to 
detect a target molecule obtained from a biological sample. 

For example, shorter pulse widths can be used in IR-MALDI mass 
spectrometric detection of target molecules, particularly employing lasers with 
optoelectronic switches. Typically, pulse widths less than about 90 ns, and 
30 generally about 80 ns, may be used in IR-MALDI mass spectrometric detection 
methods. 
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ln addition, lower electric field strength for ion extraction can be used in 
IR-MALDI mass spectrometry detection of target molecules. Field strengths of 
about less than lOOOV/mm to about 200 V/mm may typically be used in IR- 
MALDI mass spectrometry detection of target molecules. Furthermore, the 
5 single-shot ion signals are a factor of 3-5 times more intense than those 

obtained with UV-MALDI mass spectrometry, and fewer shots may be required 
to obtain an adequate signal-to-noise ratio. 

With these improvements, the choice of laser fluence (energy per unit 
area on the sample) can be much less critical. Whereas in order to avoid risking 
substantial ion fragmentation in UV-MALDI mass spectrometry it is necessary to 
restrict fluence to values between H 0 and 1.5 H 0 , in the disclosed IR-MALDI 
mass spectrometric methods for detecting target molecules, it is possible to use 
fluence values of up to 3 H 0 or 5 H 0 , particularly when glycerol is used as a 
matrix. 

In addition, glycerol, when used as a matrix in IR-MALDI mass 
spectrometry has been found to be particularly tolerant to contaminants such as 
salts, buffers, detergents, etc. in the sample being analyzed for the presence or 
absence of a target molecule. This has been surprisingly advantageous in the 
detection of target polypeptides, particularly large polypeptides, by IR-MALDI 
mass spectrometry using glycerol as a matrix because polypeptides obtained 
from biological samples can contain such contaminants. Such contaminants, 
for instance, salts, can interfere with UV-MALDI measurement of polypeptides 
using more traditional acidic solid state matrices. Accordingly, less purification 
of target molecules from biological samples is required in preparing a sample for 
analysis by IR-MALDI using a glycerol matrix than by UV-MALDI. 

For a glycerol matrix, when used in IR-MALDI mass spectrometric 
methods, the molar ratio of analyte-to-matrix is much less critical than it is for 
crystalline matrices. Analyte-to-matrix ratios in the range of about 5 x 10 3 and 
1 x 10" 6 can be employed in IR-MALDI mass spectrometric detection of target 
molecules without substantial degradation of the ion signal. This is particularly 
advantageous in the analysis of biological samples when the concentration of 
target molecule may not be known. 



WO 99/57318 



PCT/US99/10251 



-45- 



With these improved conditions and other conditions and methods as 
described herein, clear ion signals for even large, e.g., greater than 500 kDa 
proteins and greater than 700 kDa nucleic acids, target molecules from 
biological samples are obtainable using IR-MALDI mass spectrometry. Thus, the 
detection of target molecules, particularly large target molecules, obtained from 
biological samples notoriously difficult to analyze due to the presence of 
mixtures, contaminants, impurities is made possible by IR-MALDI mass 
spectrometry and further is made amenable to automation as desired in large- 
scale diagnostic and screening procedures. 
COMPOSITIONS FOR IR-MALDI ANALYSIS OF BIOLOGICAL 
MACROMOLECULES 

Compositions, which are suitable for IR-MALDI, are provided herein. 
Such a composition referred to herein as a "composition for IR-MALDI," is a 
liquid mixture containing a biological macromolecule, which is to be analyzed by 
IR-MALDI, and a liquid matrix, which absorbs infrared radiation. A biological 
macromolecule suitable for analysis by IR-MALDI can be, for example, a nucleic 
acid, a polypeptide or a carbohydrate, or can be a macromolecular complex 
such as a nucleoprotein complex, protein-protein complex, a polypsaccharide, 
an oligosaccharide, such as dextrans and dextrins, lipids, lipopolysaccharides 
and other macromolecules. 

A composition for IR-MALDI contains the biological macromolecule, for 
example, a nucleic acid, and the liquid matrix, generally in a ratio of about 10" 4 
to 10- 9 . The composition for IR-MALDI and can contain less than about 10 
picomoies of biological macromolecule to be analyzed, for example, about 
100 attomol to about 1 picomole (pmol) of the biological macromolecule. A 
composition for IR-MALDI also can contain an additive, which facilitates 
detection of the biological macromolecule by IR-MALDI. For example, an 
additive can improve the miscibility of the biological macromolecule in the liquid 
matrix. For example, a composition can contain a nucleic acid as the biological 
macromolecule to be analyzed by IR-MALDI and glycerol as the liquid matrix. 
The liquid matrix can be treated with a cation exchange material prior to mixing 
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with the nucleic acid, if desired, to reduce alkali salt formation with the 
phosphate backbone. 

A composition for IR-MALDI can deposited on a substrate, for example a 
sol,d support such as a siiicon wafer, a beed. other support Know to those of 
sk,ll ,n the art, thereby providing a solid support having deposited thereon a 
composition for IR-MALDI. 

In particular, the solid support can be a silicon wafer and a plurality of 
compositions for IR-MALDI can be deposited on the wafer in an addressable 
array. ,f desired, a composition for IR-MALDI can contain two or more different 
10 biological macromolecu.es to be analyzed, provided the biological 

macromolecules are differentially identifiable due, for example, to mass 
modification. 

Liquid matrices 

As defined above, a liquid matrix refers to a materia, tha, is compatible 
w„h the macromolecule of interest, absorbs IR. and can form a glass (rather 
then a crystalline structure). A liquid matrix has a sufficient absorption a, the 
wavelength of the laser to be used in performing desorption and ionization and 
.s a liquid (not a solid or a gas) at room temperature (one atmosphere 



15 



pressures) 

20 



In add lt ,on, for purposes herein in performing IR-MALDI, contemplated 
matrices in embodiments for methods of diagnosis and detection of proteins and 
nucleic acids also can include materials that form crystalline structures. Such 
materials include, bu, are no, limited to, water, ice and succinic acid and 
p.ccolinic acid and other acids. These types o, materials include those tha, do 
form ordered structures when cooled, dried and/or are under pressure These 
types of matrices are contemplated for use in detection methods of proteins 
using |R MALDI. When succinic acid is dipensed on a selected substrate ,or 
support, for IR MALDI. preferably, nucleic acid should ba added prior to 
d,spensing. For other matrices tha, are dried on the the substrate, nucleic acids 
30 can be added to the dried matrix material. 
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For absorption purposes, the liquid matrix can contain at least one 
chromophore or functiona! group that strongly absorbs infrared rad,at,on. 
Examples o, appropriate functional groups include nitro, sulfonyl, sulfon.c acd. 
sulfonamide, nitrile or cyanide, carbonyl, aldehyde, carboxylic acid, am.de, 
5 ester, anhydride, ketone, amine, hydroxyl. aromatic rings, dienes and other 
conjugated systems. 

Preferred liquid matrices, include bu, are not limited to, substituted or 
unsubstituted .1) alcohols, preferably non-volatile liquids (or liquids of low 
volatility), including glycols, such as glycerol, 1 ,2-propanediol or 1 .3- 
10 propanediol, 1 ,2-butanedio,, , ,3-butanediol. 1 ,4-butanediol and triethanolamine, 
sucrose, mannose and other poiyols; (2, carboxylic acids including formic acd. 
lactic acid, acetic acid, propionic acid, butanoic acid, pentanoic acid and 
hexanoic acid, and esters thereof: (31 primary or secondary amides, includmg 
acetamide. propanamide, butanamide, pentanamide and hexanamide. whether 
! B branched or unbranched; (4) primary or secondary amines, ,nclud,ng 

propylamine, butylamine. pentylamine. hexylamine, heptylamine. diethylam.ne 
and dipropylamine; (5) nitriles, hydrazine and hydrazide. 

Particularly preferred compounds contain eight or fewer carbon atoms. 
For example, particularly preferred carboxylic acids and amides conta.n s,x or 
20 fewer carbon atoms, preferred amines contain about three to about seven 

carbons and preferred nitriles contain eight or fewer carbons. Compounds that 
are unsaturated to any degree can contain a larger number of carbons, s.nce 
unsaturation confers liquid properties on a compound. Although the particular 
compound used as a liquid matrix must contain a functional group, the matrix 
25 preferably is not so reactive that it fragments or otherwise damages the nucle.c 

acid to be analyzed. 

An appropriate liquid matrix should be miscible with a nucleic acd 
compatible solvent. Preferably, the liquid matrix also should have an 
appropriate viscosity, for example, typically less than or equal to about 
30 1 5 s/m*. P referab.y in the range of about 1 s/m* to about 2 s/m*, which is the 
viscosity of glycerol at room temperature, to facilitate dispensing of microhter or 



WO 99/57318 



PCT/US99/10251 



-48- 



nanoliter volumes of matrix alone or mixed with a nucleic acid compatible 
solvent. 

For use herein, a liquid matrix also should have an appropriate survival 
time in the vacuum of the analyzer, typically having a pressure in the range of 
about 10 10 mbars, to allow the analysis to be completed. Liquids having an 
appropriate survival time are "vacuum stable," a property that is strictly a 
function of the vapor pressure of the matrix, which, in turn, is strongly 
dependent on the sample temperature. Preferred matrices have a low vapor 
pressure at room temperature such that less than about fifty percent of the 
sample in a mass analyzer having a back pressure less than or equal to 
10" 5 mbars evaporates in the time needed for the analysis of all samples 
introduced, for example, about 10 minutes to about 2 hours. For a single 
sample, for example, the analysis may be performed in minutes, whereas, for 
multiple samples, the analysis may require hours for completion. 

Glycerol, for example, can be used as a matrix at room temperature and 
in a vacuum for about 10 to 15 minutes. If glycerol is to be used for analyzing 
multiple samples in a single vacuum, the vacuum may need to be cooled to 
maintain the sample at a temperature in the range of about -50°C to about - 
100°C (about 173°K to about 223°K) for the time required to complete the 
analysis. Colder temperatures can also be used, including as low as about - 
200° C. Triethanolamine, in contrast, has a much lower vapor pressure than 
glycerol and can survive in a vacuum for at least about one hour, even at room 
temperature. 

Mixtures of different liquid matrices and additives to such matrices may 
be desirable to confer one or more of the properties described above. For 
example, an appropriate liquid matrix can contain a small amount of a 
composition containing an IR absorbing chromophore and a greater amount of 
an IR invisible (nonabsorbing) material, in which, for example/the nucleic acid is 
soluble. It also may be useful to use a matrix that is "doped" with a small 
amount of a compound or compounds having a high extinction coefficient (E) at 
the laser wavelength used for desorption and ionization, for example, 
dinitrobenzenes or polyenes. An additive that acidifies the liquid matrix also 
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may be added to dissociate double stranded nucleic acids or to denature 
secondary structure of nucleic acids such as tRNA or other RNA. Additional 
additives may be helpful for minimizing salt formation between the matrix and 
the phosphate backbone of the nucleic acid. For example, the additive can 
contain an ammonium salt or ammonium loaded ion exchange bead, which 
removes alkali ions from the matrix. Alternatively, the liquid matrix can be 
distilled prior to mixture with the nucleic acid composition, to minimize salt 
formation between the matrix and the phosphate backbone of the nucleic acid. 

The liquid matrix also can be mixed with an appropriate volume of water 
or other liquid to control sample viscosity and rate of evaporation. Since all of 
the water is evaporated during mass analysis, an easily manipulated volume, for 
example, 1 can be useful for sample preparation and transfer, but still result 
in a very small volume of liquid matrix. As a result, only small volumes of 
nucleic acid sample are required to yield about 10 16 to about 10 12 moles (about 
100 attomol to about 1 pmol) of nucleic acid in the final liquid matrix droplet. 

As disclosed herein, when glycerol is used as a matrix, the final 
analyte-to-glycerol molar ratio (concentration) should be in the range of about 
10- 4 to 10" 9 , depending on the mass of the nucleic acid, which can range up to 
about 10 4 Daltons to about 10 6 Daltons or greater, and the total amount of 
nucleic acid available. For example, for the sensitivity test disclosed herein, the 
relatively high concentration of nucleic acid used was measured by standard UV 
spectrophotometry. Practically speaking, the appropriate amount of nucleic acid 
generated, for example, from a PCR or transcription reaction generally is known. 
The large range specified indicates that the actual amount of nucleic acid 
analyzed is not very critical. Typically, a greater amount of nucleic acid results 
in a better spectrum. There may be instances where the nucleic acid sample 
requires dilution. 

Other liquid matrices include, but are not limited to triethanolamine, lactic 
acid, 3-nitrobenzylalcohol, diethanolamine, DMSO, nitropheynloctylether (3- 
NPOE), 2,2'dithiodiethanol, tetraethyleneglycol, dithiotrietol/erythritol 
(DTT/DTE), 2,3-dihydroxy-propyl-benzyl ether, a-tocopherol, and thioglycerol. 
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IMMOBILIZATION OF A BIOLOGICAL MACROMOLECULE TO A SOLID 
SUPPORT OR SUBSTRATE 

For IR-MALDI mass spectrometric analyses, a target biological 

macromolecule or other biological macromolecule of interest can be immobilized 

to a substrate, particularly a solid support, in order to facilitate manipulation of 

the biological macromolecule. Solid supports are well known in the art and 

include any material used as a solid support for linking nucleic acids, proteins, 

carbohydrates, or the like (see, for example, International Publ. WO 98/20019). 

The substrate can be selected to be impervious to the conditions of 

IR-MALDI mass spectrometric analyses, and can be functionalized for the 

immobilization of biological macromolecules or can be further associated with a 

second solid support, if desired. Where a substrate, for example, a bead is to 

be conjugated to a second solid support, biological macromolecules can be 

immobilized on the functionalized bead before, during or after it is conjugated to 

the second support. 

A biological macromolecule can be conjugated directly to a solid support 

or can be immobilized indirectly through a functional group present either on the 

support, or a linker attached to the support, or the biological macromolecule or 

both. For example, a polypeptide can be immobilized to a solid support through 

a hydrophobic, hydrophilic or ionic interaction between the support and the 

polypeptide. Although such a method can be useful for certain manipulations 

such as for conditioning of the biological macromolecule prior to IR-MALDI mass 

spectrometry, such a direct interaction is limited in that the orientation of the 

biological macromolecule is not known and can be random based on the 

position of the interacting subunits, for example, hydrophobic amino acids in a 

polypeptide. Thus, a polypeptide or other biological macromolecule generally is 

immobilized in a defined orientation by conjugation through a functional group 

on either the solid support or the biological macromolecule or both. 

A biological macromolecule can be modified by adding an appropriate 

functional group to a terminus of the biological macromolecule, for example, to 

the 5' or 3' end of a nucleic acid, or to the carboxyl terminus or amino terminus 

of a polypeptide, or to a reactive group in the biological macromolecule, for 
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example. .0 a reactive group o, a nuclide or to the phosphodiester 

a nucleic acid, or to a reactive side chain * an amino acid or to the pept.de 
b ac k bone o, a poiypeptide. A naturally occurring nucleotide in e nucleic ac* .or 
a naturally occurring amino acid in a poivpeptide also can contain a functional 
5 group suitable for coniugating the polypeptide to the solid support. For 
exemple. a cysteine residue present in the polypeptide can be used to 
immobilize the polypeptide to a substrate containing a sulfhydry, group, for 
example, a solid support having cysteine residues attached thereto, through 
disulfide linkage. Other bonds that can be formed between two amino acids. 
,0 for example, include monoaulfide bonds between two lanthionine residues. 

which ere nen-natura,ly occurring amino acids that can be incorporeted into a 
polypeptide; a iactam bond formed by a transamida.ion reaction between the 
le chains of an acidic amino acid and a basic amino acid, such as between be 
,-carboxyl group of Glu (or^-carboxy, group of Asp) and the .-amino group 
IB Lys- or a lactone bond produced, for example, by a crosslink between the 
hydroxy group of Ser and the y-carboxyl group of Glu ,or /*-carboxyl group of 
Asp) Thus, a so.id support can be modified to contain a desired amino acd 
residue, for example, a Glu residue, and a polypeptide heving a Ser residue 
particularly a Ser residue a, the carboxy, terminus or amino terminus, can be 
20 conjugeted to the solid support through the formation of a lactone bond. It 
should be recognized, however, the, the support need no, be modified to 
contain the perticular amino acid, for example, G,u. where it is desired to form a 
,actone-.ike bond with a Ser in the polypeptide, bu, can be modified, instead, 
contain an accessible carboxyl group, thus providing a function corresponding 

26 to the K-carboxyl group of Glu. _ 
A biological macromolecule can be modified to facilitate immobilization to 
a solid support, for example, by incorporating a chemical or physical morety at 
an appropriate position in the biologica. macromoiecule. generally at a terminus 
of the biologica, macromoiecule. The artisan wll recognize, however, that such 
30 a modification, for example, the incorporation of a biotin moiety, can affect 
ability of a particular reagent to interact specifically with the biological 
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macromolecule and, accordingly, will consider this factor, if relevant, in 
selecting how best to modify a biological macromolecule of interest, 

In one aspect of the processes provided herein, a polypeptide of interest 
can be covalently conjugated to a solid support and the immobilized polypeptide 
5 can be used to capture a target polypeptide, which binds to the immobilized 
polypeptide. The target polypeptide then can be released from immobilized 
polypeptide by ionization or volatization for IR-MALDI mass spectrometry, 
whereas the covalently conjugated polypeptide remains bound to the support. 
Accordingly, a process as disclosed herein can utilize IR-MALDI to 
10 determine the identity of polypeptides that interact specifically with a 
polypeptide of interest. For example, the identity of target polypeptides 
obtained from one or more biological samples that interact specifically with a 
immobilized polypeptide of interest can be determined, or the identity of binding 
proteins such as antibodies that bind to the immobilized polypeptide antigen of 
-merest, or receptors that bind to an immobilized polypeptide ligand of interest, 
or the like can be determined. Such a process can be useful, for example, for 
screening a combinatorial library of modified target polypeptides such as 
modified antibodies, antigens, receptors, hormones, or other polypeptides to 
determine the identity of those target polypeptides that interact specifically with 
20 the immobilized polypeptide. 

A solid support can be selected based on advantages that it can provide. 
For example, a solid support can provide a relatively large surface area, thereby 
allowing immobilization of a relatively large number of biological 
macromolecules. A solid support such as a bead can have any three 
dimensional structure, including a surface to which a biological macromolecule, 
functional group, or other molecule can be attached. 

A substrate also can be modified to facilitate immobilization of a 
biological macromolecule. A thiol-reactive functionality is particularly useful for 
.mmob.lizing a polypeptide to a solid support (International Publ 
30 WO 98/201 66). A thiol-reactive functionality can rapidly react with a 

nucleophilic thiol moiety to produce a covalent bond, for example, a disulfide 
bond or a thioether bond. A variety of thiol-reactive functionalities are known in 
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th e an. including, for example, haioacetyis such as ***** diazoketones. 
epoxy ketones; a- and ^-unsaturated carbonyis such as a-enones and 6-enones. 
and other reactive Michael acceptors such as maleimide; acid haUdes; benzy 
halides; and the like. A free thio, group of a disu.fide. for exampie, can reac 
5 with a second free thio, group by disulfide bond formation, including by d,sulf,de 
exchange. Reaction of a thiol group or other functiona, group can be prevented 
temporarily by blocking with an appropriate protecting group (see Greene and 
Wuts. Protective Groups in Organic Synthesis 2nd ed. (John Wiley t Sons 
1991)) 

,0 A thiol-reactive functionality such as 3-mercap,opropyltriethoxysilane can 

be used to functionalize a silicon surface with thiol groups. The ammo 
functionalized silicon surface then can be reacted with a heterobifunctiona, 
reagent such as N-succinimidyl (4-iodacetyl) aminobenzoate (SIAB; P.erce; ^ 
Rocxford 1U. If desired, the thiol groups can be Mocked with a photoc,eavab,e 
15 protecting group, which then can be selective.y cleaved, for example by 

phonography, to provide portions o, a surface activated for immob„,za ton of 
a po,ypeptide of interest. Photocleavabie protecting groups are known m the 
(see. for example. International Pub,. WO 92/10092; McCray e. aL. ^Bsy, 
n^Hvs. Chem. 18:239-270 (1989)1 and can be selectively deblocked 
20 by irradiation of selected areas of the surface using, for example, a 
photolithography mask. 

Solid Supports (substrates) 

The solid support is any known to those of skill in the art as matrix for 
performing synthetic reactions and assays. It can be fabricated from silicon 
25 glass, silicon-coated materials, metal, a composite, a polymeric matena, such as 
a plastic, a polymer-grafted materia., suich as a metal-grafted polymer, or other 
material as disclosed herein. This material can be further functionalized. as 
necessary, for example, chemically, to enhance or permit linkage of molecules 
or other particles, such as calls or call membranes or viral envelopes or other 
30 such biological materia*, of interest. The surface o, a support can be mod,f,ed. 
such as by radiation grafting of a suitable polymer on the surface and 
privatization thereof to render it suitable for binding capturing a molecuie or 
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particle, such as a cell. The support may also include beads linked thereto (see, 
copending allowed U.S. application Serial No. 08/746,036, copending U.S. 
application Serial No. 08/933,792, and International application No. 
PCT/US97/20194, which claims priority to the U.S. applications). It may also 
include dendrite trees of captured material, or combinations of such additional 
components. A solid support can have one or more target sites, each of which 
can contain or retain a volume of a liquid. 

By way of example, a solid support can be a flat surface such as a glass 
fiber filter, a glass surface, a silicon or silicon dioxide surface, a composite 
surface, or a metal surface, including a steel, gold, silver, aluminum or copper 
surface, a plastic material, including polyethylene, polypropylene, polyamide or 
polyvinylidenedifluoride, which further can be in the form of multiwell plate or a 
membrane; can be in the form of a bead (or other geometry) or particle, such as 
a silica gel, a controlled pore glass, a magnetic or cellulose bead, which can be 
in a pit of a flat surface such as a wafer, for example, a silicon wafer; or can be 
a pin, including an array of pins suitable for combinatorial synthesis or analysis 
(see, e.g., International PCT application No. WO98/20019), comb, microchip. 
The skilled artisan will recognize that various factors, including the size and 
shape of the support and the chemical and physical stability of the support to 
the conditions to which it will be exposed, will be considered in selecting a 
particular solid support for use in a disclosed system or method. 

Also contemplated is the use of the end of a fiber optic cable or plate as 
a substrate or support (see, e.g., U.S. Patent No. 5,826,214, which describes 
embodiments in which the electromagnetic radiation is delivered via a fiber optic 
cable, which can abut against a thin transparent plate on which the specimen or 
resides). 

A solid support contains one or more target sites, which can contain a 
volume of a liquid. A target site can be, for example, a well, pit, channel, or 
other depression, with or without rims, on the surface of a solid support; can be 
a pin, bead or other material, which can be positioned on a surface of a solid 
support; or can be a physical barrier such as a cylinder, cone or other such 
barrier positioned on a surface of a solid support. 
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A target site also can be. for example, a reservoir or reaction ohamber. 
whic h is attached to a soiid support ,see. tor example. Waiters e, a/ *«L 
Chem 70:5172-5176 (1998,). In addition, a target site can be etched, for 
^Tp, e on a surface of a silicon wafer using a photolithographic method (see. 
5 forexamp ,e.Woo,,eye f a,(A i!fl LCh S m i 63:4081-4086 (1996,,. 

Photoiithographv aliows the construction of very smal, targe, s.tes. ,nc,ud ng 
wells or towers, and. for example, has been used in combinat.on w,th wet 
chemical-etching to construe, "picoliter vials" on microchips (Clark eta/. 
PHEMTECH 28:20-25 (1998)). 
10 ^^ppor, also can be a glass or si.icon surface containing we.ls hav.ng a 
very thin base that is transparent to eiectromagnetic radiation o, a des.red 
waveiength. such as laser light, thereby permitting measurement o, parameters. 
such as volume, or an excitation wavelength for fluorescence measurement. 

A target site also can be defined by physico-chemical parameters such as 
15 hydrophilicity. hydrophobic, the presence of acidic or basic groups, groups 

pa Je of forming a sa„ bridge, or any surface chemistry the, allows a „«,u, to 
g row primarily in the z direction. For example, where the liquid to 
a target site is water or an aqueous composition, the targe, site can be de, ne 
by a hydrophilic area surrounded by a hydrophobic area on ,he surface of a sol.d 
20 support, or by a series of rows, alternately having less hydrophobic rows and 
more h drophobic rows, whereby the aqueous mixture is constrained to t e ess 
hydrophobic rows. With respect to such a target site, the 
is dispensed, for example, onto the hydrophilic area, and is constrained from 
spreading from the target site due to the accent and surrounding hydrophob.c 
25 area. Conversely, where the liquid is a nonpolar liguid. it is dispensed onto a 
hydrophobic region and is constrained in ,ha, region due to an ad.acen, 
hydrophilic region or a region or ,hat is less hydrophobic that the region to 
which the liquid is applied. 

A solid support can have a single target site, or can contain a number 
30 target sites, for example, 2 sites. 10 sites. 16 sites. 100 sites. 144 sites 384 
sites 1000 sites, or more, al, or some of which can be the same or can be 
different. Where a solid support contains more than one targe, s„e and. 
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therefore, can contain, for example, more than one reaction mixture, the 
characteristics that define each target site serve not only to constrain a reaction 
mixture, but also to prevent intermingling of different reaction mixtures or other 
liquids on the support. In addition, where a solid support contains more than 
i one target site, the target sites can be arranged in any pattern, for example, in a 
line, a spiral, concentric circles, rows, or an array of rows and columns. 
Furthermore, the location of each target site of a number of target sites on a 
support can be defined. The availability of such addressable target sites on a 
solid support allows multiple reactions to be performed in parallel and is 
convenient, for example, for performing multiplex reactions, for including control 
reactions with test reactions such that all are performed under identical 
conditions, for performing a similar reaction under different conditions, or for 
performing different reactions. 

Thus, any substrate on which the nucleic acid/liquid matrix can be 
deposited and retained for desorption and ionization of the nucleic acid can be 
used in a process provided herein. Preferred substrates include, but are not 
limited to beads, for example, silica gel, controlled pore glass, magnetic, 
cross-linked dextrans, such as those sold under the tradename Sephadex 
(Pharmacia) and agarose gel, such as gels sold under the tradename Sepharose 
(Pharmacia), which is a hydrogen bonded polysaccharide-type agarose gel 
(epichlorhydrins), or cellulose; capillaries; flat supports, for example, filters, 
plates or membranes made of glass, metal surfaces such as steel, gold, silver, 
aluminum, copper or silicon, or plastic such as polyethylene, polypropylene, 
polyamide or polyvinylidene fluoride; pins, for example, arrays of pins suitable 
for combinatorial synthesis or analysis of beads in pits of flat surfaces such as 
wafers, with or without filter plates. 

Preferably the selected substrate and format are amenable to 
miniaturization, such as the chips that retain the deposited material by virue of 
hydrophobic or hydrbphilic interaction, described above, in which the target site 
can be defined by a hydrophilic area surrounded by a hydrophobic area on the 
surface of a solid support (or the converse). 
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Preferably nuc.eic acid samples are prepared and deposited as a thin 
, av er f a n— to about a 100 ,m iava, P^ra b,y -ween 
a hnnt 100 uru more preferably 1 V™ to 10 //m, onto a 

• ===2=S= 

and published Internationa, PCT application WO 98/20,66. 
Immobilization and activation 
Numerous methods have been developed for the immobilized of 

p r 0 ,eins. nucleic acids and other ^^^^Z^ 
, ,, nn7 n) Mrthmi° Fnyvmology 44, Weetan v 10/ 

T^^ene^M^^ 

253-391. see, ge ■ b. Jakoby, M. Wilchek, Acad. Press, 

Hr ,^H. ipjasymolegy, Vo'- ed - W ' V ,„ nrar ,hv Advances 

.iL^M^^ 42 ' ed - r - Duniap ' p,enum 

N " Toig the most common.y used methods are absorption and adaption 
or covaient ling to the support, either direct, or via a linker, sue as b 
numerous disuifide .inkages. thioetber bonds, hindered d.sulf.de bond, a 
covaient bonds between free reactive groups, such as amme and th,o, groups, 
kn0 wn ,0 those of ski., in art Isee. e^. the PIERCE CATALOG, 
lunoTechnology Cataiog * Handbook, ,99,1993, which 
preparation of and use of such reagents and provides a commercial sourc ,o 
such reagents; and Wong .1993, May^tei^^ 
L kino CRC Press- see, a.so OeWitt et aL .1993, Pro.Jiai^cad^U^ 
Linking, CRC Kress ' se ' ^ _ 1 1 a- 10646- Kurth et aU 

90:6909; Zuckermann et aL (1992, J.Am.Chern.Soc, U4-10646. 
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(1994) J. Am. Chem. Soc. 116:2661; Ellman et ab_ (1 994) Proc. Natl. Acad. 
Sci. U.S.A. 91:4708; Sucholeiki (1994) Tetrahedron Lttrs. 35:7307; and Su- 
Sun Wang (1976) J. Org. Chem. 4J.:3258; Padwa et aL (1 971 ) J. Org. Chem. 
41:3550 and Vedejs et aL (1 984) J. Org. Chem. 49:575, which describe 
» photosensitive linkers] 

To effect immobilization, a composition of the protein or other 
biomolecule is contacted with the support material such as any described 
herein, alumina, carbon, an ion-exchange resin, cellulose, glass or a ceramic. 
Fluorocarbon polymers have been used as supports to which biomolecules have 
been attached by adsorption [see, U.S. Pat. No. 3,843,443; Published 
International PCT Application WO/86 03840]. 

A large variety of methods are known for attaching biological molecules, 
including proteins and nucleic acids, molecules to solid supports [see. e^, U.S. 
Patent No. 5451683]. Such linkages may be effected through covalent bonds, 
ionic bonds and other interactions. The linkages may be reversible or labile to 
certain conditions, such as particular EM frequencies. 

For example, U.S. Pat. No.. 4,681 ,870 describes a method for 
introducing free amino or carboxyl groups onto a silica support. These groups 
may subsequently be covalently linked to other groups, such as a protein or 
other anti-ligand, in the presence of a carbodiimide. Alternatively, a silica 
support may be activated by treatment with a cyanogen halide under alkaline 
conditions. The anti-ligand is covalently attached to the surface upon addition 
to the activated surface. Another method involves modification of a polymer 
surface through the successive application of multiple layers of biotin, avidin 
and extenders [see, e^, U.S. Patent No. 4,282,287]; other methods involve 
photoactivation in which a polypeptide chain is attached to a solid substrate by 
incorporating a light-sensitive unnatural amino acid group into the polypeptide 
chain and exposing the product to low-energy ultraviolet light [see, e^, U.S. 
Patent No. 4,762,881]. 

Oligonucleotides have also been attached using a photochemically active 
reagents, such as a psoralen compound, and a coupling agent, which attaches 
the photoreagent to the substrate [see, e^, U.S. Patent No. 4,542,102 and 
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U S Patent No. 4.562.167,. Photoactivation of the photoreagent binds a 
S g ,ass. synthet.c polymers a may be 

used rrr - as . ™- ^ ^. - 

dire ° '' I 17940 and Smhh « * 0 992, Ma***^— ^ . rf 

perfiuorocarbon pCv— supper, ln lhis method the 

c.omatography * desenbed U-S^t. ^ ^ as 

bio mo,ecu,e is firs. mod,f,ed by eact.on ^ ^ 

perfluorooctyipropyiisoeyanate dasenbed ,n U.S. 
n6 modified protein is adsorbed onto the liuorocarbon support to 

'^"lation and use or supports are we,, tnown and may be ejected 
The Hermanson et ah (1 992) Immob^ 

- 

^^ aMb2a£ja£m ^ 4. 73 ,1992,. me. . 
64-919; Loetscher e. ah ,1992, J^hMarafih, 59^1 13 1 99. 
5 443.816; Ha,e (1995, Anabai^LBio^en. 23J.:«-49 1. 
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Other suitable methods for linking molecules and biological particles to 
solid supports are well known to those of skill in this art [see, e.g. , U.S. Patent 
No. 5,416,1931. These linkers include linkers that are suitable for chemically 
linking molecules, such as proteins and nucleic acid, to supports include, but are 
5 not limited to, disulfide bonds, thioether bonds, hindered disulfide bonds, and 
covalent bonds between free reactive groups, such as amine and thiol groups. 
These bonds can be produced using heterobifunctional reagents to produce 
reactive thiol groups on one or both of the moieties and then reacting the thiol 
groups on one moiety with reactive thiol groups or amine groups to which 
10 reactive maleimido groups or thiol groups can be attached on the other. Other 
linkers include, acid cleavable linkers, such as bismaleimideothoxy propane, acid 
labile-transferrin conjugates and adipic acid diihydrazide, that would be cleaved 
in more acidic intracellular compartments; cross linkers that are cleaved upon 
exposure to UV or visible light and linkers, such as the various domains, such as 
5 C H 1 , C H 2, and C H 3, from the constant region of human IgG, (see, Batra et aL 
(1993) Molecular Immunol. 30:379-386). Presently preferred linkages are direct 
linkages effected by adorbing the molecule or biological particle to the surface 
of the support. Other preferred linkages are photocleavable linkages that can be 
activated by exposure to light [see, e^, Goldmacher et aL (1 992) Bioconi. 
Chem. 3:104-107, which linkers are herein incorporated by reference]. The 
photocleavable linker is selected such that the cleaving wavelength that does 
not damage linked moieties. Photocleavable linkers are linkers that are cleaved 
upon exposure to light [see, e^. Hazum et aL (1 981 ) in Pept.. Proc. Eur. Pent. 
Symp., 16th , Brunfeldt, K (Ed), pp. 105-1 10, which describes the use of a 
nitrobenzyl group as a photocleavable protective group for cysteine; Yen et aL 
(1 989) Makromol. Chem 190:69-82, which describes water soluble 
photocleavable copolymers, including hydroxypropylmethacrylamide copolymer, 
glycine copolymer, fluorescein copolymer and methyjrhodamine copolymer; 
Goldmacher etaL (1992) Bioconj. Chem. 3:104-107, which describes a cross- 
linker and reagent that undergoes photolytic degradation upon exposure to near 
UV light (350 nm); and Senter et aL (1 985) Photochem. Photnhiol ao-o^.o^ 
which describes nitrobenzyloxycarbonyl chloride cross linking reagents that 
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produce photocleavable linkages). The selected linker will depend upon «he 
particular application and, if needed, may be empirically selected. 
Linkers 

A biological macromolecule can be immobilized directly to a substrate or 
oan be immobilized through a linking moiety or moieties. Immobilization can be 
effected by any desired linkage including covalent linkages, ionic hnkages. 
physical linkages, and any other linkages known. The linkage can be revers, le 
and/or cleavable. Any linker known to those of skill in the ar, to be su.table for 
immobilizing a nucleic acid, polypeptide, carbohydrate or other biolog.ca, 
macromolecule to a substrate, either directly or through a spacer, can be used 
(see international Pub,. WO 98/2001 9). Among preferred linkers are those that 
are cleave or otherwise release upon exposure to IR. 

A biological macomolecule can be immobilized directly to a support 
through a linker or can be immobilized through a variable spacer. In addmon. 
, the conjugation can be directly cleavable. for example, through a photocleavabie 
.inkage such as a streptavidin or avidin to biotin interaction, which can be 
cleaved by a laser, or indirect.y through a photocleavable linker (U.S. Patent 
No 5 643.722) or an acid labile linker, heat sensitive linker, enzymaucally 
c ,eavable linker or other such linker. Accordingly, a linker can prov,de a 
J reversible linkage such that it is cleaved under defined conditions such as dunng 
the IR-MALDI mass spectrometry procedure. Such a linker can be, for example, 
a photocleavable bond such as a charge transfer complex or a labile bond 
formed between relatively stable organic radicals. 

A linker (L) on a biological macromolecule can form a linkage, wh.ch 
. 5 generally is a temporary linkage, with a second functional group If) on the so„d 
support. Furthermore, where the biological macromolecule has a net negat,ve 
charge or is conditioned to have such a charge, the linkage can be formed w,th 
L- being, for example, a quaternary ammonium group. In this case, the surface 
of the solid support carries a negative charge that repels the negatively charged 
30 biological macromolecule, thereby facilitating desorption of the b.olog.cal 
macromolecule for IR-MALDI mass spectrometry analysis. Desorpuon can 
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occur due to the heat created by the IR radiation or, where L' is a chromophore, 
by specific absorption of IR radiation by the chromophore. 

A linkage (L-L') can be, for example, a disulfide bond, which is 
chemically cleavable by mercaptoethanol or dithioerythrol; a biotin/streptavidin 
i linkage/which can be photocleavable; a heterobifunctional derivative of a trityl 
ether group, which can be cleaved by exposure to acidic conditions (see Koster 
et aL, Tetrahedron Lett. 31:7095 (1990)); a levulinyl-mediated linkage, which 
can be cleaved under almost neutral conditions with a hydrazinium/acetate 
buffer; an arginine-arginine or a lysine-lysine bond, either of which can be 
cleaved by an endopeptidase such as trypsin; a pyrophosphate bond, which can 
be cleaved by a pyrophosphatase; or a ribonucleotide bond, which can be 
cleaved using a ribonuclease or by exposure to alkali condition. 

The functionalities, L and L', can also form a charge transfer complex, 
thereby forming a temporary L-L' linkage. The IR laser energy can be tuned to 
the corresponding energy of the charge-transfer wavelength and specific 
desorption from the solid support can be initiated. It will be recognized that 
several combinations of L and L' can serve this purpose and that the donor 
functionality can be on the solid support or can be coupled to the biological 
macromolecule to be detected or vice versa, provided a liquid matrix, which 
absorbs IR radiation, also is present. 

Selectively cleavable linkers that are particularly useful in a process as 
disclosed herein include photocleavable linkers, acid cleavable linkers, acid-labile 
linkers, and heat sensitive linkers. Acid cleavable linkers include, for example, 
bis-maleimideothoxy propane, adipic acid dihydrazide linkers (Fattom et aL. 
Infect. Immun. 60:584-589 (1992)), and acid labile transferrin conjugates that 
contain a sufficient portion of transferrin to permit entry into the intracellular 
transferrin cycling pathway (Welhoner et aL, J. Biol. Chem. 266:4309-4314 
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(1991)). Photocleavable linkers also include the linkers described in WO 
98/20019. 

Linkers suitable for chemically linking polypeptides, for example, to 
supports, include disulfide bonds, thioether bonds, hindered disulfide bonds, and 
5 covalent bonds between free reactive groups such as amine and thiol groups. 

Agents useful for creating linkages include, for example, dimaleim.de, 
dithio-bis-nitrobenzoic acid (DTNB), N-succinimidyl-S-acetyl-thioacetate (SATA), 
N-succinimidyl-3-{2-pyridyldithiol propionate (SPDP), succinimidyl 
4-(N-maleimidomethyl)cyclohexane-1 -carboxylate (SMCC) 6-hydrazino 
10 nicotimide (HYNIC). Appropriate linkers, which can be crosslinking agents, for 
use for conjugating a polypeptide to a so.id support include a variety of agents 
that can react with a functional group present on a surface of the support, or 
with the polypeptide, or both. Useful crosslinking agents include agents 
containing homobifunctional or heterobifunctional groups. Useful Afunctional 
15 crosslinking agents include, but are not limited to, M-succinimidy.(4-iodoacety.) 
aminobenzoate (SIAB), dimaleimide, dithio-bis-nitrobenzoic acid (DTNB), 
N-succinimidy.-S-acety.-thioacetate (SATA), N-succinimidy.-3-(2-pyridy.dithio) 
propionate (SPDP), succinimidyl 4-<N-maleimidomethyl)cyclohexane-1- 
carboxylate (SMCC) and 6-hydrazino-nicotimide (HYNIC). 
20 A crosslinking agent also can be used to form a selectively cleavable 

bond between a biological macromolecule and a solid support. For example, a 
photolabile crosslinks such as 3-amino-(2-nitrophenyl)propionic acid (Brown et 
aL, Mnlec. Divers. 4-12 (1995); Rothschild et aL, NucK Acids Res. 24:351-66 
(1996); U.S. Patent No. 5,643,722) can be employed as a means for cleaving a 
25 polypeptide from a solid support. Other crosslinking reagents are well known in 
the art (see, for example, Wong, Chemistry of Protein Conjugation and Cross- 
Linking (CRC Press 1 991 ); Hermanson, Bioconjugate Techniques (Academic 
Press 1996)). 

D Hydroxyester linkers, including, for example, hydroxyacetate (glycolate), 

30 a -, Y- w-hydroxyalkanoates, W -hydroxy(polyethylene glycoDCOOH, 

hydroxybenzoates, hydroxyarylalkanoates and hydroxyalkylbenzoates. can be 
useful for immobilizing a biological macromolecule. Photocleavable linkers also 
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are useful for immobilizing a biological macromolecule; methods of preparing 
such linkers are provided in International Publ. WO 98/20019. In addition, a 
bifunctional trityl linker can be attached to a solid support, for example, to the 
4-nitrophenyl active ester on a resin such as a Wang resin, through an amino 
group or a carboxyl group on the resin via an amino resin. Using a bifunctional 
trityl approach, the solid support can require treatment with a volatile acid such 
as formic acid or trifluoracetic acid to ensure that the biological macromolecule 
can be removed. In such a case, the biological macromolecule can be deposited 
as a headless patch at the bottom of a well of a solid support or on the flat 
surface of a solid support. After addition of a matrix composition, the biological 
macromolecule can be desorbed during IR-MALDI mass spectrometry. 

Hydrophobic trityl linkers also can be exploited as acid-labile linkers by 
using a volatile acid or an appropriate matrix composition, which is acidic or 
contains an additive that renders the liquid matrix acidic, to cleave an amino 
linked trityl group from the biological macromolecule. Acid lability also can be 
changed. For example, trityl, monomethoxytrityl, dimethoxytrityl or 
trimethoxytrityl can be changed to the appropriate p-substituted, or more acid- 
labile tritylamine derivatives. 

Other linkers, include, for example, Rink amide linkers (Rink, Tetrahedron 
Letters 28:3787 (1976)), tritylchloride linkers (Leznoff, Ace. Chem. Res. 11:327 
(1978)), Merrifield linkers (Bodansky et ah. Peptide Synthesis 2d ed.. Academic 
Press; New York, 1986); trityl linkers (U.S. Patent Nos. 5,410,068 and 
5,612,474); and amino trityl linkers (U.S. Patent No. 5,198,531). 

Other linkers include acid cleavable linkers such as bis-maleimideothoxy 
propane, acid labile transferrin conjugates and adipic acid dihydrazide linkers 
that can be cleaved in more acidic intracellular compartments; photocleavable 
cross linkers that are cleaved by IR, visible or UV light, RNA linkers that are 
cleavable by ribozymes or other RNA enzymes, and linkers such as the various 
domains, including C H 1, C H 2, and C H 3, from the constant region of human IgG, 
(see, Batra et ah, Mol. Immunol. 30:379-386 (1993)). Combinations of any 
linkers also can be useful, for example, a linker that can be cleavable under IR- 
MALDI mass spectrometry conditions such as a silyl linkage or photocleavable 
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15 



linkage can be combined with a linker such as an avidin biotin linkage, which is 
not cleaved under IR-MALDI mass spectrometry conditions but can be cleaved 
under other conditions. 

A biological macromolecule of interest can be immobilized to a solid 
support such as a bead. In addition, a first solid support such as a bead also 
can be conjugated to a second solid support, which can be a second bead or 
other substrate, by any suitable means. In particular, any of the conjugation 
methods and means disclosed herein with reference to conjugation of a 
biological macromolecule to a solid support also can be applied for conjugation 
of a first support to a second support, where the first and second solid supports 
can be the same or different. Furthermore, use of Afunctional linkers allows for 
orthogonal cleavage of a biological macromolecule from a support, or of a first 
support from a second. 

It should be recognized that any of the binding members disclosed herein 
or otherwise known in the art can be reversed with respect to the examples 
provided herein. Thus, biotin, for example, can be incorporated into either a 
biological macromolecule or a solid support and. conversely, avidin or other 
biotin binding moiety would be incorporated into the support or the polypept.de, 
respectively. Other specific binding pairs contemplated for use herein are 
20 exemplified by hormones and their receptors, enzymes and their substrates, a 
nucleotide sequence and its complementary sequence, an antibody and the 
antigen to which it interacts specifically, and other such pairs known to those 
skilled in the art. 

A target biological macromolecule, particularly each target biological 
25 macromolecule in a plurality of target biological macromolecules, can be 
immobilized to a solid support prior to mass modifying, conditioning, or 
otherwise manipulating the biological macromolecule. In particular, the solid 
support can be a flat surface, or a surface with a structure such as welts, such 
that each of the target biological macromolecules in the plurality can be 
30 positioned in an array, each at a particular address. In general, a target 

biological macromolecule is immobilized to the solid support through a cleavable 
linker such as an acid labile linker, a chemically cleavable linker or a 
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photocleavable linker. Following a reaction of the target biological 
macromolecule in a disclosed process, undesirable reaction products can be 
washed from the reaction and the remaining immobilized target biological 
macromolecule can be released, for example, by chemical cleavage or 
i photocleavage, as appropriate, and can be analyzed by IR-MALDI mass 

spectrometry. It should be recognized, however, that manipulation of a 

biological macromolecule, for example, by mass modification prior to performing 
a chemical or enzymatic degradation or other reaction can influence the rate or 
extent of the reaction. Accordingly, the skilled artisan will know that the 
influence of conditioning, mass modification, or the like on the extent of a 
reaction should be characterized prior to initiating a process. 

In some cases, it can be useful to immobilize a particular target biological 
macromolecule to a support through both termini of the biological 
macromolecule, for example, the amino terminus and the carboxyl terminus of a 
polypeptide using, for example, a chemically cleavable linker at one terminus 
and a photocleavable linker at the other end. In this way, the target biological 
macromolecule, which can be immobilized, for example, in an array in wells, can 
be contacted, for example, with one or more agents that cleave at least one 
bond linking the monomer subunits in the biological macromolecule, the internal 
biological macromolecule fragments then can be washed from the wells, along 
with the agent and any reagents in the well, leaving one biological 
macromolecule fragment of the target biological macromolecule immobilized to 
the solid support through the chemically cleavable linker and a second biological 
macromolecule fragment, from the opposite end of the target biological 
macromolecule, immobilized through the photocleavable linker. Each fragment 
then can be further manipulated using a process as disclosed herein or can be 
analyzed by IR-MALDI mass spectrometry following sequential cleavage of the 
fragments, for example, after first cleaving the chemically cleavable linker, then 
cleaving the photocleavable linker. Such a process provides a convenient 
means of analyzing both termini of a biological macromolecule, thereby 
facilitating analysis of the target biological macromolecule. 
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immobilization of a target biological macromolecule at both termini can 
be performed by modifying both ends of the biological macromolecule, for 
example, one terminus being modified to allow formation of a chemically 
cleavable linkage with the solid support and the other terminus being mod.f.ed 

5 to allow formation of a photocleavable linkage with the solid support. 

Alternatively, the biological macromolecules can be split into two portions, one 
portion being modified at one terminus allow formation, for example, of a 
chemically cleavable linkage, and the second portion being modified at the other 
terminus to allow formation, for example, of a photocleavable linkage. The two 

10 populations of modified biological macromolecules then can be immobilized, 
together, on a solid support containing the appropriate functional groups for 
completing immobilization. 

IR-MALDI MASS SPECTROMETRY ANALYSIS OF BIOLOGICAL 
MACROMOLECULES 

15 Tne presses disclosed herein are useful for analyzing a biological 

macromolecule by subjecting a composition containing the biological 
macromolecule and a liquid matrix, which absorbs IR radiation, to IR-MALDI 
mass spectrometry. Depending on the process selected, the presence of a 
biological macromolecule can be detected, for example, in a biological sample; 
20 or a particular biological macromolecule can be identified, for example, by 
comparison to a corresponding known biological macromolecule, or by 
determining its molecular mass or at least a part of its subunit sequence (see, 
for example, U.S. Patent Nos. 5,503,980; 5,547,835; 5,605.798; and 
5,691,194; see, also, International Pubis. WO 94/16101; WO 94/21822, WO 
25 96/29431 ; WO 97/37041 ; WO 97/42348; and WO 98/2001 9). 
Mass spectrometric analysis using an IR laser 

The support containing a sample can be placed in a vacuum chamber of 
a mass analyzer to identify or detect the nucleic acid in the sample. Preferably, 
the mass analyzer can maintain the temperature of a sample at a preselected 
30 value, for example, a temperature in the range of at least about -200°C to 

about 80°C, preferably at least about -60° C to about 40° C, more preferably - 
200° C to about 20° C, and most preferably about -60° C to about 20° C, during 
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sample preparation, disposition and/or analysis. For example, improved spectra 
may be obtained, in some instances, by cooling the sample to a temperature 
below room temperature during sample preparation or mass analysis. Further, 
as described above, the vacuum stability of a matrix may be increased by 
cooling. Alternatively, it may be useful to heat a sample tp denature double 
stranded nucleic acids into single strands or to decrease the viscosity during 
sample preparation. 

Desorption and ionization of the sample is performed in the mass 
analyzer using infrared radiation. Preferred infrared wavelengths include in the 
are in the mid-IR wavelength region, from about 2.5 pm to about 12 pm. 
Preferred sources of infrared radiation are CO lasers, which emit at about 6 pm; 
C0 2 lasers, which emit at about 9.2 //m to 1 1 pm; Er lasers, with any of a 
variety of crystals, for example, Er-YAG (yttrium-aluminum-garnet), Er-YILF or 
Er-YSGG, emitting at wavelengths about 3 pm; and optical paramagnetic 
oscillator lasers emitting in the range of about 2.5 /7m to about 1 2 pm. 
Pulse duration, field strength and other parameters 

Solid state Erbium lasers with pulse widths around 100 ns can be used 
for infrared Matrix-Assisted Laser Desorption/Tonization mass spectrometry (IR- 
MALDI MS) [Overberg et al, Rapid Commun. Mass Spectrom., 1990, 4, 293- 
296; Berkenkamp et al.. Rapid Commun. Mass Spectrom., 1997, 1 1, 1399- 
1406]. Optical parametric oscillators (OPO) with pulse durations of a few 
nanoseconds may also be used in IR-MALDI MS. The fixed pulse width of the 
OPO systems of a few nanoseconds is determined by the pump laser. The 
pulse duration and/or size of the irradiated area (spot size) can be varied to 
generate multiple charged ions. A preferred pulse duration is in the range of 
about 100 picoseconds (psec) to about 500 nanoseconds (ns). 

An Er:YAG- and an OPO laser were used to investigate pulse width and 
wavelength dependence of IR-MALDI-MS in the 5-200 ns pulse width and 3 pm 
wavelength region. For laser pulse durations from 90 to 1 85 ns an Er:YAG 
laser (Spektrum GmbH, Berlin, Germany, wavelength /I = 2.94 pm) was used. 
The pulse duration was varied by changing the Q-switch delay time. For the 
Nd:YAG pumped OPO laser (Mirage 3000B, Continuum, Santa Clara, CA, USA) 
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th e pulse width was fixed a, 6 ns. whereas this system is tunabie 

,„ 4.0 „m. The waveiength seaie was caiibrated to an accuracy of ± 5 

nanometers. An in-house-buii, TOP instrument with a linear ,2.2 m, and 

reflectron port (3.5 m equivalent flight iengthl was used. The mass 

' LL can be operated with static or delayed ion — n Spec,. 

optics were impiemented to permit a rapid interchange of the two lase earns. 

A 1 50 fim pinhoie was illuminated by the centra, par, of the Oauss.an 

and imaged onto the sample to ensure a homogeneous and equal sampie 

„,umination for both iasers. A,, spectra were obtained under ident.ca, 

instrumental conditions and from identical samples. 

Results: a) To a first approximation the threshold fluences for the 

generation o, Cytochrome C mass spectra were independent of the pulse 

d iration in t h* range of 6 to 1 85 ns. 




Por the OPO-systems the threshold fluences were consistently £ I ~ * 
significantly lower by up to a factor of 1.5 as compared to the Er.YAG laser 

le, the .radiances of - 50 MW,cm> «, - 6 ns, for the OPO system and 
of -2 M WW <r = 18.5 ns, for the Er:YAG laser differ by a factor o -26. 
2S is therefore, concluded, that the desorption in IR-MALDl is governed by the 
girted .ergy per unit voiume, rather than the peaK power or irradiance for 

pulse durations up to 200 ns. . 

b, Within the experimental error, mass resolut.on for s.gnals 
of peptides, desorbed out of a succinic acid matrix, was observed to be 
30 independent o, the puise width within the range o, 6 - 100 ns for stabc 

de!a ed ion extraction. For longer pulses up to 200 ns and stat.c ,on extracts 
tne resolution decreased by up to a facte, of two. In the analysis of the 
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influence of laser pulse widths on the peak resolution of Gramicidin S, an 
optimal resolution of m/Am = 1 1000 was observed for 6 ns OPO laser pulses 
with delayed ion extraction, as well as for 1 00 ns Erbium laser pulses in the 
linear mode of the mass spectrometer. 
5 c) For the 6 ns P u,se s an increase in the abundance of 

multiply charged ions and a decrease of signals of oligomers was observed, as 
compared to 100 ns pulses. 

d) The threshold fluence for the generation of IR-MALDI 
spectra was determined in the wavelength range from 2.6 „m to 3.6 pm for 
10 several solid and liquid matrices with the OPO laser system. They were 

compared to the corresponding transmission spectra of the matrices [Merke, R. 
Langenbucher, F., Infrared Spectra, Heyden & Co., Freiburg, 1964]. A clear 
correlation between the threshold fluences for succinic acid and glycerol on 
their (inverse) transmission was observed in a study of the influence of laser 
5 wavelength A on the threshold fluence H 0 of cytochrome C. For glycerol the 
double peak structure is clearly reproduced. A similar behavior was observed 
for triethanolamine. For succinic acid the threshold fluence follows the 
absorption spectrum in the range of 3.2 - 3.6 p m . The surprisingly !ow 
threshold fluence between 2.8 and 3.2 pm seems to reflect the strong 
0 absorption of residual water in the succinic acid microcrystals. 

Field strengths typically less than 1000 V/mm, preferably as low as 200 
V/mm, particularly for proteins, are used. 

A preferred spot size is in the range of about 50 ^m in diameter to about 
1 mm. IR-MALDI can be matched with an appropriate mass analyzer, including 
Imear (.in) or reflector (ref), with linear and nonlinear fields, for example, curved 
feld reflectron, time-of-flight (TOF), single or multiple quadrupole, single or 
multiple magnetic sector, Fourier transform ion cyclotron resonance or ion trap 
Preferably, detection is performed using a linTOF or a refTOF mode instrument 
m posrtive or negative ion modes, so that the ions are accelerated through a 
total potential difference of about 3 kV to about 30 kV in the split extraction 
source using static or delayed ion extraction (DE). TOF mass spectrometers 
separate ions according to their mass-to-charge ratio by measuring the time it 
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takes generated ions to travel to a detector. The technology behind TOF mass 
spectrometers is described for example in U.S. Patent Nos. 5,627,369; 
5 625 184; 5,498,545; 5,160,840 and 5,045.694. Delayed extraction w.th 
delay time ranging from about 50 nsec to about 5 „sec may improve the mass 
5 resolution of some nucleic acids, for example, nucleic acids in the mass range of 
from about 30 kDa to about 50 kDa, using either a liquid or solid matrix. For 
delayed extraction, conditions are selected to permit a longer optimum 
extraction delay and hence a longer residence time, which results in increased 
resolution (see, Juhasz et aL, Anal. Chem. 68:941-946 (1996); Vesta, et 

n0 at Pf- Mass Spectrom. 9:1044-1050 (1995); see, also, U.S. 

Patent Nos. 5,777,325; 5,742,049; 5.654,545; 5,641.959; 5,654,545; and 
5 760 393, for descriptions of MALDI and delayed extraction protocols). In 
delayed ion extraction, a time delay is introduced between the formation of the 
ions and the application of the accelerating field. During the time lag, the .ons 
15 move to new positions according to their initial velocities. By properly choosmg 
the delay time and the electric fields in the acceleration region, the time of fhght 
of the ions can be adjusted so as to render the flight time independent of the 
initial velocity to the first order. 
ANALYSIS OF NUCLEIC ACIDS BY IR-MALDI 
20 Methods and processes for sequencing, diagnosis and detection of 

nucleic acids using UV MALD. have been developed and are known to those of 
ski«, in the art (see, e.g., U.S. Patent Nos. 5,605,798, 5,830,655, 5,700,642, 
allowed U.S. application Serial No. 08/617,256, published Internat.onal PCT 
application Nos. WO 96/29431, WO 98/20019, WO 99/14375, WO 97/03499, 
25 WO 98/26095 and others). 

Processes of using IR-MALDI to analyze a nucleic acid in a liquid matrix 
are provided. Nucleic acids to be analyzed according to a process provided 
herein can include any single stranded or double stranded polynucleotide such 
as DNA, including genomic DNA and cDNA; RNA; or an analog of RNA or DNA, 
30 as well as nucleotides or nucleosides and any derivative thereof. Nuc.e.c acds 
can be of any size ranging from single nucleotides or nucleosides to tens of 
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thousands of base pairs. For analysis herein, preferred nucleic acids contain 
about one thousand nucleotides or less. 

Nucleic acids may be obtained from a biological sample, which can be 
any material obtained from any living source, including a human, animal, plant, 
i bacterium, fungus, protist or virus, using any of a number of procedures that 
are well known in the art. A particular isolation procedure for obtaining a 
nucleic acid from a biological sample can be selected as appropriate for the 
particular biological sample. For example, freeze-thaw or alkaline lysis 
procedures can be useful for obtaining nucleic acid molecules from solid 
materials; heat and alkaline lysis procedures can be useful for obtaining nucleic 
acids from blood {Rolff et aL, PCR: Clinical Diagnostic and Research (Springer 
Verlag 1994)). 

Prior to being mixed with a liquid matrix, the particular nucleic acid to be 
analyzed may be further processed to yield a relatively pure, isolated nucleic 
acid sample. For example, a standard ethanol precipitation may be performed 
on restriction enzyme digested DNA. Alteratively, PCR products may require 
primer removal prior to analysis. Likewise, RNA strands can be separated from 
the molar excess of premature termination products always present in in vitro 
transcription reactions. 
SEQUENCING 

Exemplary formats and strategies 

Any sequencing strategy known to those of skill in the art, including 
Sanger, exonuclease and hybridization methods can be adapted for use with IR 
MALDI methods provided herein, by liquid matrices and and IR MALDI. For 
example, a Sanger sequencing strategy assembles the sequence information by 
analysis of the nested fragments obtained by base-specific chain termination via 
their different molecular masses, which can be determined using IR-MALDI. 
Further increases in throughput, if needed can be obtained by conditioning the 
nucleic acid fragments, such as by introducing mass modifications into the 
oligonucleotide primer, the chain-terminating nucleoside triphosphates and/or 
the chain-elongating nucleoside triphosphates, as 
well as using integrated tag sequences that allow multiplexing by 
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hybridization of tag specific probes with mass differentiated molecular 
weights. 

Exonuclease-based sequencing protocols can also be performed. These 
methods, which include those described in U.S. Patent No. 5.622,824 adapted 
for use with IR-MALDI. involve a direct sequencing approach and can begin wit 
DNA fragments cloned into conventional cloning vectors. The DNA is by means 
of protection, specificity of enzymatic activity, or immobilization, unilaterally 
degraded in a stepwise manner via exonuclease digestion and the nucleotides or 
derivatives detected by mass spectrometry. Prior to the enzymatic degradation, 
sets of ordered deletions that span the whole sequence of the cloned DNA 
fragment are created. In this manner, mass-modified nucleotides can be 
incorporated using a combination of exonuclease and DNA/RNA polymerase. 
This permits either multiplex mass spectrometry detection, or modulation of the 
activity of the exonuclease so as to synchronize the degradative process. 
15 Methods for sequencing by hybridization include methods of positional 

sequencing by hybridization (see, e.g.. U.S. Patent No. 5,503,980, 5,795,714 
and 5,631,134). Briefly, sequencing by hybridization refers to methods 
methods of sequencing a nucleic acid by 

hybridizing that nucleic acid with a set of nucleic acid probes containing 
20 random, but determinable sequences within the single stranded portion 
adjacent to a double stranded portion where the single stranded portion 
of the set preferably comprises every possible combination of sequences 
over a predetermined range. Hybridization occurs by complementary 
recognition of the single stranded portion of a target with the single 
stranded portion of the probe and is thermodynamically favored by the 
presence of adjacent double strandedness of the probe. In particular, a method 
for determining a nucleotide sequence of a nucleic acid target 
by hybridization includes the steps of creating a set of nucleic acid probes, 
wherein each probe is preferably about 14-50 nucleotides in length and has a 
30 double stranded portion, a single stranded 

portion, and a variable sequence within the single stranded portion that 



25 



3M5DOCID- <WO 995731BA2_L> 



WO 99/57318 



PCT/US99/10251 



-74- 

is determinable; hybridizing the target that is at least partly single stranded to 
one or more of the nucleic acid probes; and determining the nucleotide 
sequence of the target that is hybridized to the single stranded portion of any 
probe. To detect the probes the target can be labeled with a first detectable 
5 label at a terminal site and a second different detectable label at an internal site. 
The labels are selected to be detectable by IR mass spectrometry. 7 

Examples of the above formats 

In one exemplary direct sequencing embodiment, the method of 
sequencing obtaining multiple nucleic acid copies of the target nucleic acid, 
where the multiple copies contain at least one mass modified nucleotide, 
corresponding to one of the four possible nucleotide bases; cleaving the 
multiple nucleic acid copies from a first end to a second end with an 
exonuclease having an activity, which is inhibited by the mass-modified 
nucleotide, thereby generating base terminated nucleic acid fragments; 
identifying the nested nucleic acid fragments by IR-MALDI; and (iv) determining 
the sequence of the target nucleic acid from the identified nested nucleic acid 
fragments. 

In all formats, the nucleic acids can be immobilized, including in array 
formats. Immobilization can be effected with linkers that are cleavable, such as 
by the IR radiation emitted by the IR laser. The linkages can be reversible or 
irreversible. 

Thus, processes for determining a subunit sequence of a target biological 
macromolecule also are provided. A sequence of a target biological 
macromolecule can be determined by contacting the biological macromolecule 
with an agent that cleaves the biological macromolecule unilaterally from a 
terminus of the biological macromolecule, to produce a nested set of deletion 
fragments; preparing a composition containing the nested set of biological 
macromolecule fragments and a liquid matrix, which absorbs infrared radiation; 
determining the molecular weight value of each biological macromolecule 
fragment in the composition by IR-MALDI mass spectrometry; and determining 
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th e sequence o, the nucleic acid from the molecular weight values of the 
biological macromolecule fragments in the set. 

A sequence o, a target nucleic acid, for example, can be determmed by 
subjecting the targe, nucleic acid to exonuclease digestion for various penods of 
li t: produce a nested set of deletion fragments contain^ the targe, nucle.c 
acid sequence (see Internationa, Pub,. WO 94/21822), then analyzing the 
nested se, of deletion fragments by IR-MALDI. Similarly, a sequence of a targe, 
polypeptide can be determined by subjecting the polypeptide to an 
exopep.idase. which can be a carboxypeptidase such as carboxypeptidase Y. 
carboxypeptidase P, carboxypeptidase A, carboxypeptidase Q or 
carboxypeptidase B; or an aminopeptidase such as alanine aminopept.dase. 
leucine aminopeptidase. pyroglutamate peptidase, dipep.idy, peptidase and 
mi crosoma, peptidase; or a chemical polypeptide fragmenting agent such as 
phenylisothiocyanate, for various periods o, time to produce a nestec set of 
1S fragments of the biologica, macromolecule, which can be analysed b IR-MALDI 
mass spectrometry to de,ermine ,he sequence of ,he ,arge, b,o,og,ca, 
Lromolecule (see. also, ft*- Utf*. Pages 273-276 ,ed., N.C. Pnce; OS 
Scientific Publ.. 1996); listing polypeptide fragmenting agents). Exonucleases. 
peptidases and exoglycosidases are we,, Known in tine art .see, for examp e, 
20 U S Pa.ent No. 6,821,063). as are melhods of modifying tine ac„v„y of such 
agen,s (see, for example, U.S. Pafen, No. 5,792,664; ,n,erna,iona, Publ. 

WO 96/36732). . 

A sequence of a targe, biological macromolecule also can be de.ermmed 
by ,rea,ing the biologica. macromolecule wi,h an agen, ,ha, cleaves ,he 
25 biological macromolecule uni,a,era,,y from a ferminus, in a time-limited manner 
and identifying the released monomer subunits by IR-MALDI mass spectrometry. 
„ desired, degradation of a targe, biologica, macromolecule can be performed ,n 
a reac,or appara.us (see Imernationa, Pub,. WO 94,21 822,, in which ,he 
biologica, mac,omo,ecu,e can be free in composition and the agen, ,ha, cleaves 
30 can be immobilized, or in which .he agen, ,ha, cleaves can be free ,n 

composition and ,he biologica, macromolecule can be immobilized. A, „me 
intervals or as a continuous stream, the reaction mixture containing released 
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subunits is transported from the reactor for analysis by IR-MALDI mass 
spectrometry. Prior to IR-MALDI mass spectrometry analysis, the released 
subunits can be transported to a reaction vessel for conditioning, which can be 
by mass modification. 
5 A sequence of a target biological macromolecule also can be determined 

by generating at least two biological macromolecule fragments from the target 
biological macromolecule; preparing a composition containing the biological 
macromolecule fragments and a liquid matrix, which absorbs infrared radiation; 
and analyzing the biological macromolecule fragments in the composition by IR- 
) MALDI mass spectrometry, thereby determining the sequence of the target 
nucleic acid molecule. In particular, such a process can be useful for 
determining the order of subunit sequences within a large biological 
macromolecule sequence (see International Publ. WO 98/20019). 

A process of determining the subunit sequence of at least one species of 
target biological macromolecule, i, also is provided. Such a process can be 
performed, for example, by contacting the species of target biological 
macromolecule with one or more agents sufficient to cleave each the bonds 
between each monomer subunit in the target biological macromolecule, to 
produce a nested set of deletion fragments; preparing a composition containing 
at least one biological macromolecule fragment of the set and a liquid matrix, 
which absorbs infrared radiation; and determining the molecular mass of the at 
least one biological macromolecule fragment by IR-MALDI mass spectrometry; 
and repeating these steps until the molecular mass of each biological 
macromolecule fragment in said set has been determined, thereby determining 
the subunit sequence of the species of target biological macromolecule. Such a 
process is particularly suitable for multiplex analysis of a plurality of i + 1 species 
of target biological macromolecules. For multiplex analysis, each species of 
target biological macromolecule can be differentially mass modified such that a 
biological macromolecule fragment of each species of target biological 
macromolecule can be distinguished from every other biological macromolecule 
species by IR-MALDI mass spectrometry. 
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A process of determining the nucleotide seguence of a, ieas, one speces 
o, nucleic acid aiso is provided. Such a process can be performed by 
synthesizing commentary nucieic acids, which are comp.emen.ary to the 
species of nucieic acid to be seguenced, starting from an oligonudeot.de - 
5 and in the presence of chain terminating nucleoside triphosphates, to produce 
,our sets of base-specifically terminated complementary polynucleot.de 
fragments, preparing a composition for IR-MALDl tha, conta ins four sets of 
polynucleotide fragments and a liguid matrix, which absorbs .nfrared rad at.on. 
determining the molecular weight va,ue of each polynucleotide fragment by 
10 IR-MALDl mass spectrometry, and determining the nucleotide seguence of the 
species of nucleic acid by aligning the molecular weigh, values according to 
molecular weigh,. The process is particularly suitable to muKipiex ana ys,s o a 
p,ura,ity o, i + 1 species of nudeic acids, which can be seguenced concurrently 
usingi + f primers. For multiplex analysis, one of the i + 1 primers is an 
15 unmodified primer or a mass modified primer, and the cther i primers are mass 
modified primers, such the, each of the i + 1 primers can be d,st,ngu,shed from 
every o,her primer by IR-MALDl mass spectrometry. 

A seguence of a target nucleic acid also can be determined by 
hybridizing a, least one partiaHy single stranded target nucleic acid to one or 
20 more nucleic acid probes, each probe containing a double -ended portion a 
single stranded portion, and a determinable variable seguence w,th,n the s.ngle 
stranded portion, to produce at least one hybridized target nucleic acd; 
preparing a composition containing the hybridized target nucleic acid and a 
,i qui d matrix, which absorbs infrared radiation; and determining a seguence of 
2B the hybridized targe, nucleic acid by IR-MALDl mass spectrometry based on the 
determinable variable seguence of the probe to which the target nucle.c acd 
hybridized (U.S. Patent No. 5.503. 980). Optionally, a hybridized target nucle.c 
acid can be ligated to the determinable variabie seguence. If desired, the steps 
of the process can be repeated a sufficient number of times ,o de,erm,ne an 
30 entire seguence o, a target nucleic acid. Where a plurality of target nucle.c 
acids are to be seguenced, the one or more nucleic acid probes can be 
immobilized in an array. 
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IR-MALDI mass spectrometry also can be used to determine a nucleic 
acid sequence by analyzing a target polypeptide encoded by the nucleic acid. 
Since the mass of a polypeptide is only about 10% of the mass of its encoding 
nucleic acid, the translated polypeptide can be more amenable to mass 
spectrometric detection. In addition, IR-MALDI mass spectrometry detection of 
polypeptides can yield analytical signals of high sensitivity and resolution (see 
Berkenkamp et al.. Rapid Commun. Mass Soectmm 11:1399-1406 (1997)). 
Oligonucleotide sizing, fingerprinting and sequencing using IR-MALDI 
mass spectrometry and immobilized cleavable primers 
IR-MALDI mass spectrometry can also be used, in conjunction with the 
immobilized cleavable primers described in U.S. Patent No. 5,830,655 and U.S. 
Patent No. 5,700,642 or other such primers, to determine the size of a primer 
extension product. In one specific embodiment, a method for determining the 
size of a primer extension product is provided. It includes the steps of (a) 
15 hybridizing a primer with a target nucleic acid, where the primer (i) is 

complementary to the target nucleic acid; (ii) has a first region containing the 5' 
end of the primer, and (iii) has a second region containing the 3' end of the 
primer, where the 3' end is capable of serving as a priming site for enzymatic 
extension and where the second region contains a selected cleavable site; (b) 
20 extending the primer enzymatically to generate a polynucleotide mixture 
containing an extension product composed of the primer and an extension 
segment; (c) cleaving the extension product at the cleavable site to release the 
extension segment; and (d) sizing the extension segment by IR-MALDI mass 
spectrometry with a liquid matrix, whereby the cleaving is effective to increase 
25 the read length of the extension segment relative to the read length of the 
product of (b). 

In one embodiment, the target nucleic acid contains an immobilization 
attachment site and is thereby immobilized by attachment to a solid support. 
The target nucleic acid can be immobilized prior to the extending. Also 
30 preferably, the target nucleic acid is immobilized prior to the cleaving. Further 
more preferably, the product of (b) from the immobilized target nucleic acid is 
separated prior to the cleaving step. 
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In another embodiment, the cleavable site is a nucleotide capable of 
blocking 5' to 3' enzyme-promoted digestion, and where the cleaving is carried 
out by digesting the first region of the primer with an enzyme having a 5' to 3' 
exonuclease activity. In another embodiment, the cleavable site is located at or 
5 within about five nucleotides from the 3' end of the primer. More preferably, 
the second region of the primer is a single nucleotide that also contains the 
cleavable site, such as. but are not limited to, a ribonucleotide, dialkoxysilane, 
3'-(S)-phosphorothioate, 5'-{S)-phosphorothioate, 3'-(N)-phosphoramidate, 
5'-(N)phosphoramidate, uracil or ribose. The enzyme for extending the 

10 primer in step (b) can be a DNA polymerase. 

In yet another embodiment, the extending is carried out in the presence 
of a nucleotide containing (i) an immobilization attachment site and (ii) a 
releasable site, which is thereby incorporated into the extension segment. More 
preferably, a further step of immobilizing the extension segment at the 
15 immobilization attachment site and releasing the extension segment at the 

releasable site prior to the sizing by IR-MALDI mass spectrometry is included. 

In another specific embodiment, a method for determining the size of a 
primer extension product is provided, which method comprises (a) hybridizing a 
primer with a target nucleic acid, where the primer (i) is complementary to the 
20 target nucleic acid; (ii) has a first region containing the 5' end of the primer, and 
an immobilization attachment site, where the immobilization attachment site of 
the primer is composed of a series of bases complementary to an intermediary 
oligonucleotide, and (iii) has a second region containing the 3' end of the 
primer, where the 3' end is capable of serving as a priming site for enzymatic 
25 extension and where the second region contains a selected cleavable site, (b) 
extending the primer enzymatically to generate a polynucleotide mixture 
containing an extension product composed of the primer and an extension 
segment; (c) cleaving the extension product at the cleavable site to release the 
extension segment, where prior to the cleaving the primer is immobilized by 
30 specific hybridization of the immobilization attachment site to the intermediary 
oligonucleotide bound to a solid support; and <d) sizing the extension segment 
by IR-MALDI mass spectrometry with a liquid matrix, whereby the cleaving is 
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effective to increase the read length of the extension segment relative to the 
read length of the product of (b). 

In still another specific embodiment, a method for determining the size of 
a primer extension product is provided that includes (a) combining first and 
5 second primers with a target nucleic acid, under conditions that promote 

hybridization of the primers to the nucleic acid, generating primer/nucleic acid 
complexes, where the first primer (i) has a 5' end and a 3' end, (ii) is 
complementary to the target nucleic acid, (iii) has a first region containing the 5' 
end of the first primer and (iv) has a second region containing the 3' end of the 
10 first primer, where the 3' end is capable of serving, as a priming site for 

enzymatic extension and where the second region contains a cleavable site, and 
where the second primer (i) has a 5' end and a 3' end, (ii) is homologous to the 
target nucleic acid, (iii) has a first segment containing the 3' end of the second 
primer, and (iv) has a second segment containing the 5' end of the second 
15 primer and an immobilization attachment site; (b) converting the primer/nucleic 
acid complexes to double-stranded fragments in the presence of a DNA 
polymerase and deoxynucleoside triphosphates; (c) amplifying the 
primer-containing fragments by successively repeating the steps of (i) 
denaturing the double-stranded fragments to produce single-stranded fragments, 
20 (ii) hybridizing the single stranded fragments with the first and second primers 
to form strand/primer complexes, (iii) generating amplification products from the 
strand/primer complexes in the presence of DNA polymerase an 
deoxynucleoside triphosphates, and (iv) repeating steps (i) to (iii) until a desired 
degree of amplification has been achieved; (d) immobilizing amplification 
25 products containing the second primer via the immobilization attachment site; 
(e) removing non-immobilized amplified fragments; (f) cleaving the immobilized 
amplification products at the cleavable site, to generate a mixture including a 
double-stranded product; (g) denaturing the double-stranded product to release 
the extension segment; and (h) sizing the extension segment by IR-MALDI mass 
30 spectrometry with a liquid matrix, whereby the cleaving is effective to increase 
the read length of the extension segment relative to the read length of the 
amplified strand-primer complexes of (c). 
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ln another embodiment, the method for determining the size of a 
includes the steps of (a) hybridizing a primer with a target nucleic acid, where 
the primer (i) is complementary to the target nucleic acid; (ii) has a first region 
containing the 5' end of the primer and an immobilization attachment site, and 
5 (iii) has a second region containing the 3' end of the primer, where the 3' end .8 
capable of serving as a priming site for enzymatic extension and where the 
second region contains a selected cleavable site, (b) extending the primer 
enzymatically to generate a polynucleotide mixture containing an extension 
product composed of the primer and an extension segment; (c) cleaving the 
10 extension product at the cleavable site to release the extension segment, where 
prior to the cleaving the primer is immobilized at the immobilization attachment 
site; and (d) sizing the extension segment by IR-MALDI mass spectrometry with 
a liquid matrix, whereby the cleaving is effective to increase the read length of 
the extension segment relative to the read length of the product of (b). The 
15 enzyme for extending the primer in step (b) can be a DNA polymerase. 

In one embodiment, the cleavable site is located at or within about five 
nucleotides from the 3' end of the primer. More preferably, the second region 
of the primer is a single nucleotide that also contains the cleavable site, such 
as, but are not limited to, a ribonucleotide, dialkoxysilane, 
20 3'-(S)-phosphorothioate, 5'-(S)phosphorothioate, 3'-(N)-phosphoramidate, 
5'-(N)phosphoramidate, or ribose. 

In another embodiment, a further step of washing the immobilized 
product prior to the cleaving step is included. In another embodiment, the 
primer is immobilized on a solid support by attachment at the immobilization 
25 attachment site to an intervening spacer arm bound to the solid support. More 
preferably, the intervening spacer arm is six or more atoms in length. The 
immobilization attachment site preferably occurs as a substituent on one of the 
bases or sugars of the DNA primer. In another embodiment, the immobilization 
attachment site is biotin or digoxigenin. In another embodiment, the primer is 
30 immobilized on a solid support, including, but are not limited to, glass, silicon, 
polystyrene, aluminum, steel, iron, copper, nickel or gold. 
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In another embodiment, the method for determining the size of a primer 
includes the steps of: (a) combining first and second primers with a target 
nucleic acid under conditions that promote the hybridization of the primers to 
the nucleic acid, thus generating primer/nucleic acid complexes, where the first 
5 primer (i) is complementary to the target nucleic acid; (ii) has a first region 

containing the 5' end of the primer and an immobilization attachment site, and 
(iii) has a second region containing the 3' end of the primer, where the 3' end is 
capable of serving as a priming site for enzymatic extension and where the 
second region contains a cleavable site, and where the second primer is 
homologous to the target nucleic acid; (b) converting the primer/nucleic acid 
complexes to double-stranded fragments in the presence of a suitable 
polymerase and all four dNTPs; (c) amplifying the primer-containing fragments 
by successively repeating the steps of (i) denaturing the double-stranded 
fragments to produce single-strand fragments, (ii) hybridizing the single strands 
with the primers to form strand/primer complexes, (iii) generating 
double-stranded fragments from the strand/primer complexes in the presence of 
DNA polymerase and all four dNTPs, and (iv) repeating steps (i) to (iii) until a 
desired degree of amplification has been achieved; (d) denaturing the amplified 
fragments to generate a mixture including a product composed of the first 
primer and an extension segment; (e) immobilizing amplified fragments 
containing the first primer, utilizing the immobilization attachment site, and 
removing non-immobilized amplified fragments; (f) cleaving the immobilized 
fragments at the cleavable site to release the extension segment; and (g) sizing 
the extension segment by IR-MALDI mass spectrometry with a liquid matrix, 
whereby the cleaving is effective to increase the read length of the extension 
segment relative to the read length of the product of (d). 

In another embodiment, a method for determining a single base 
fingerprint of a target DNA sequence is provided. The method includes the 
steps of (a) hybridizing a primer with a target DNA. where the primer (i) is 
complementary to the target DNA; (ii) has a first region containing the 5' end of 
the primer and an immobilization attachment site, and (iii) has a second region 
containing the 3' end of the primer, where the 3' end is capable of serving as a 
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priming site for enzymatic extension end where the seoond region contams a 
selected cleavabie site; (b) extending the primer with an enzyme in the presence 
of a dideoxynucleoside triphosphate corresponding to the single base, to 
generate a polynucleotide mixture of primer extension products, each product 
ccntaining a primer end an extension segment; (c) cleaving the extension 
products at the cleavabie site to release the extension segments, where poor to 
the cleaving the primers are immobilized at the immobilization attachment srtes; 
,d) sizing the extension segments by IR-MALD, mass spectrometry with a hqu.d 
matrix whereby the cleaving is effective to increase the read length of any 
given extension segment relative to the read length of its corresponding pnmer 
extension product of (b). and (e) determining the positions of the single base in 
the target DNA by comparison of the sizes of the extension segments. 

In another embodiment, a method for an adenine fingerprint of a target 
DNA sequence by (a) hybridizing a primer with a DNA target, where the pnmer 
(i) is complementary to the target DNA; (II) has a first region conta.nmg the 
end of the primer and an immobilization attachment site, and (Hi. has a second 
region containing the 3' end of the primer, where the 3' end is capable of 
serving as a priming site for enzymatic extension and where the second reg.on 
contains a selected cleavabie site; (bl extending the primer with an enzyme ,n 
, the presence of deoxyadenosine triphosphate (dATP), deoxythymidine 
triphosphate IdTTP), deoxycytidine triphosphate (dCTP), deoxyguanos.ne 
triphosphate (dGTP), and deoxyuridine triphosphate (dUTPI, to generate a 
poiynucleotide mixture of primer extension products containing dUTP at 
positions corresponding to dATP in the targe,, each product containing a pr,mer 
5 and an extension segment; Ic) treating the primer extension products with uracl 
DNA-glycosylase to fragment specifically at dUTP positions to produce a set of 
primer extension degradation products; (d) washing the primer extens.on 
degradation products, where prior to the washing, the primer extens.on 
degradation products are immobilized a, the immobilization attachment srtes, 
,0 each immobilized primer extension degradation product containing a primer and 
an extension segment, where the washing is effective to remove 
non-immobilized species; (e) cleaving the immobilized primer extension 
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degradation products at the cleavable site to release the extension segments; (f) 
sizing the extension segments by IR-MALDI mass spectrometry with a liquid 
matrix, whereby the cleaving is effective to increase the read length of any 
given extension segment relative to the read length of its corresponding primer 
extension degradation product; and "(g) determining the positions of adenine in 
the target DNA by comparison of the sizes of the released extension segments. 

In another specific embodiment, a method for determining the DNA 
sequence of a target DNA sequence is provided, which method comprises (a) 
hybridizing a primer with a target DNA, where the primer (i) is complementary to 
the target DNA; (ii) has a first region containing the 5' end of the primer and an 
immobilization attachment site, and (iii) has a second region containing the 3' 
end of the primer, where the 3' end is capable of serving as a priming site for 
enzymatic extension and where the second region contains a cleavable site, (b) 
extending the primer with an enzyme in the presence of a first of four different 
dideoxy nucleotides to generate a mixture of primer extension products each 
product containing a primer and an extension segment; (c) cleaving at the 
cleavable site to release the extension segments, where prior to the cleaving the 
primers are immobilized at the immobilization attachment sites; (d) sizing the 
extension segments by IR-MALDI mass spectrometry with a liquid matrix, 
whereby the cleaving is effective to increase the read length of the extension 
segment relative to the read length of the product of (b), (e) repeating steps (a) 
through (d) with a second, third, and fourth of the four different dideoxy 
nucleotides, and (f) determining the DNA sequence of the target DNA by 
comparison of the sizes 

of the extension segments obtained from each of the four extension reactions. 

In yet another specific embodiment, a method for determining the DNA 
sequence of a target DNA sequence is provided, which method comprises (a) 
hybridizing a primer with a target DNA, where the primer (i) is complementary to 
the target DNA; (ii) has a first region containing the 5' end of the primer and an 
immobilization attachment site, and (iii) has a second region containing the 3' 
end of the primer, where the 3' end is capable of serving as a priming site for 
enzymatic extension and where the second region contains a cleavable site, (b) 
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ending the primer wi,h an enzyme in «he presence of a first of four d fe e 
deoxynucleoside o-,hiotriphosphate anaiogs (dNTPaS, to generate a m.xture of 
primer extension products containing pnosphorothioate linkages, (c, treat.ng the 
primer extension products with a reagent that Ceaves specifically at the 
5 phosphorothioate linkages, where the treating is carried out under condmons 
producing limited cleavage, resuiting in the production of a group of pnmer 
extension degradation products, tdf washing the primer extension degradat.on 
products, where prior to the washing, the primer extension degradat.on 
products are immobilized at the immobilization attachment sites, each 
10 immobiiized primer extension degradation product containing a primer and an 
extension segment, where the washing is effective to remove non-.mmob, zed 
species, <e, cleaving at the cleavable site to release the extension segments ,., 
sizing the extension segments by IR-MALDI mass spectrometry with a l.qu.d 
matrix, whereby the cleaving is effective to increase the read length of any 
15 given extension segment relative to the read ,er.gth of its corresponding pnmer 
lension degradation product, ... repeating steps (a, through <f> w.t a second, 
third and fourth of the four different dNTPoSs, and (h. determining the DMA 
sequence of the target DNA by comparison of the sizes of the extens.on 
segments obtained from each of the four extension reactions. More preferably, 
20 the reagent of step <c, is exonuclease. 2-iodoe,hano,. or 2,3-epoxyl-propanol. 
DIAGNOSIS AND DETECTION 
Diagnostics 

Using a process as disclosed herein, accurate (at least about 
1% accurate, masses of a DNA sample can be obtained for a, leas, about 
25 2000-mer DNA .masses of at least about 650 kDa, and at least about 1 200-mer 
RNA (masses of at .east about 400 kDa; see Examp,e 1). In addition, signals of 
single stranded, as well as double stranded, nucleic acids can be obtained ,n the 
spectra (see Figure 3). The improved accuracy for measuring the mass of DNA 
by IR-MALDI mass spectrometry (accuracy of at leas, about 1 %» is far supenor 
30 ,o that provided by standard agarose ge. sizing o, nucleic acids (accuracy of 
about 5%). The accuracy of mass determination of RNA by IR-MALDI mass 
spectrometry (accuracy of a, least about 0.5%) is even more significant, s.nce 
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an accurate size determination of RNA by gel analysis is difficult, if not 
impossible, in part because of the absence of suitable size markers and of a 
sufficiently suitable gel matrix. 

In addition to the extension in mass range obtained using a process as 
5 disclosed herein, there is a dramatic decrease in the amount of analyte needed 
for preparation of the sample for mass spectrometry, down to the low 
femtomole (fmol) or attomole (attomol) range, even with an essentially simple 
preparation method. Also, by using a liquid matrix rather than a solid matrix, 
the ion signals generated are more reproducible from shot to shot. Use of a 
10 liquid matrix also facilitates sample dispensation, for example, onto various 

fields of a chip array. Furthermore, by using a liquid matrix in conjunction with 
IR-MALDI mass spectrometry, essentially all sample left on the target after 
IR-MALDI analysis can be retrieved for further use. 
DIAGNOSIS AND DETECTION 
15 A process of determining the molecular mass of a target biological 

macromolecule by IR-MALDI mass spectrometry is provided. Such a process 
can be performed, for example, by preparing a composition for IR-MALDI 
containing the biological macromolecule to be analyzed and a liquid matrix, 
which absorbs infrared radiation; and analyzing the biological macromolecule in 
20 the composition by IR-MALDI mass spectrometry (see Example 1 ; see, also, 
Berkenkamp et aL, Rapid Commun. Mass Soectrom. 1 1:1399-1406 (1997); 
Berkenkamp et ah. Science 281 :260-262 (1 998)). The molecular mass of the 
target biological macromolecule is determined by running, in parallel or in a 
separate spectrum, one or more control biological macromolecules having 
25 known molecular masses, and comparing the spectrum produced by the target 
spectrum with the spectrum of the control biological macromolecules. A control 
biological macromolecule, which can be a corresponding known biological 
macromolecule, generally is of the same type of molecule as the target 
biological macromolecule, for example, each is a nucleic acid or each is 
30 polypeptide. The control biological macromolecule need not be the same type 
of molecule as a target biological macromolecule in order to determine the 
molecular mass of the target biological macromolecule (see Example 1 ). 
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IR-MALDl mass spectrometry also can be used fcr detecting a targe, 
piologica, macromolecule by preparing a composition containing a biological 
macromolecule and a liquid matrix, which absorbs infrared radiation; and 
performing IR-MALDl mass spectrometry on the composition to identrfy the 
afge, blgica, macromolecule in the composition, thereby detecting , e targe 
„ Igica, macromo,ecu,e. if desired, the target bioiogica, macromolecu e can be 
presen , in or isolated from a biological sample. Accordingly, a process for 
identifying the presence of a target biological macromolecule in a b.olog.cal 
sample also is provided. 

The presence of a target biological macromolecule. for example, a 
nucleic acid in a biological sample can be identified by preparing a compos„,on 
for IR-MALDl. containing a biological sample containing nucleic add molecules 
to, nucleic acid molecules isolated from the biological sample, and a l,gu,d 
m a,rix. which absorbs infrared radiation; then analyzing the composition by 
IR-MALDl mass spectrometry Detection of a nucleic acid molecule hav.ng a 
mo ,ecu,ar mass of the target nucleic acid sequence identifies the presence of 
t „e target nucleic acid sequence in the biological sample. The molecular mass 
of the target biologica, macromolecule can be determined by comparison to a 
control spectrum, or can be determined based on the spectrum produced by a 
, corresponding known biologica, macromoiecuie. Alternately, a sequence of 
th e biological macromolecule can be determined, thereby identifying the 
presence of the biological macromolecule. 

Since the processes disclosed herein allow a characterization of a targe, 
bio ,ogica, macromclecule ob,ained from a biologica, sample, IR-MALD. mass 
specromeuy can be used ,o identify an individual having a disease or condmon. 
or a predisposition ,o a disease or condition, by detecting a charactenst.c of a 
,arge, biological macromolecule ,ha, is associa,ed with ,he disease or the 
cendition. Such a process can be performed, for example, by preparmg a 
composition for IR-MALDl, containing the biological macromolecule. wh.ch ,s 
obtained from an individual to be tested, and a liquid matrix, which absorbs 
infrared radiation; and analyzing the biological macromolecule. or a relevant 
portion of <he biological macromolecule. in ,he composition by IR-MALDl mass 



WO 99/57318 



PCT7US99/I025I 



-88- 



spectrometry. A determination of a particular mass of the target biological 
macromolecule identifies the individual as having the disease or condition or a 
predisposition to the disease or condition. Such a process is particularly useful 
for identifying a genetic disease, or a disease associated with a bacterial 
5 infection, or a predisposition to such a disease, and also is useful for 

determining identity, heredity or compatibility. Additional processes disclosed 
herein also are useful for such a diagnosis, for example, by determining the 
sequence of the target biological macromolecule obtained from the individual or 
by comparison of the target biological macromolecule with a corresponding 
10 known biological macromolecule. 

The disclosed processes using IR-MALDI are suitable to analyzing more 
than one sample of biological macromolecule, particularly a large number of 
samples, for example, by depositing a plurality of compositions, each containing 
one or more biological macromolecules, on a solid support such as a chip, in the 
15 form of an array, if desired. In addition, the disclosed processes are suitable for 
multiplex analysis of a plurality of biological macromolecules contained in a one 
or a few compositions containing a liquid matrix. Each biological 
macromolecule in a plurality can be differentially mass modified, for example, to 
facilitate multiplex analysis. Accordingly, the processes are readily adaptable to 
20 high throughput assay formats. 

A biological macromolecule particularly suitable for analysis by a process 
of IR-MALDI can be a nucleic acid, a polypeptide, a carbohydrate, or a 
proteoglycan, or can be a macromolecular complex such as a protein-protein 
complex or a nucleoprotein complex. For analysis, a target biological 
macromolecule can be immobilized to a substrate, particularly a solid support, 
which can be, for example, a bead, a flat surface, a chip, a capillary, a pin, a 
comb, or a wafer, and can be any of various materials, including a metal, a 
ceramic, a plastic, a resin, a gel, and a membrane. For example, the solid 
support can be a silicon wafer or a stainless steel flat surface. Since the 
processes as disclosed herein are particularly useful for analyzing a large 
number of target biological macromolecules in high throughput assays, it can be 
particularly useful to immobilize a plurality of target biological macromolecules in 
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an array on a solid support. Immobilization can be through a reversib.e hnkage 
such as a photoc.eavab.e bond or a thiol linkage or a hydrogen bond, and the 
linkage can be cleaved using, for example, a chemical process, an enzymat.c 
process, or a physical process, including during the mass spectrometry ana.ys.s 
procedure. 

Where a target biological maoromoleoule is a nucleic acid, for example, 
the target nucleic acid can be immobilized by hybridization (hydrogen bond.ng) 
. between a complementary capture nucleic acid molecule, which is immob,l,zed 
to the solid support, and a portion of the nucleic acid molecule conta.rang the 
target nucleic acid. It should be recognized, however, that, for some processes 
disclosed herein, a, least a portion of the sequence containing the target nuc.e,c 
acid should be distinct from the hybridizing portion of the target nucle.c acd 
when immobilization is through hybridization to a capture nucleic acid, for 
example, where a detector oligonucleotide is to be hybridized to a sequence of 
i the target nucleic acid. 

Where the target biological macromolecule is a polypeptide, it can be 
immobilized to a solid support by binding to a reagent, which is conjugated to 
the so.id support and specifically interacts with at least a portion of the target 
polypeptide or with a tag attached to the target polypeptide. Such a reagent 
D can be, for example, an antibody that binds an epitope of the target 

polypeptide, or can be, for example, nickel ion, which binds to a po.yhistid.ne 
sequence tag contained in the target polypeptide. A tag peptide such as a 
polyhistidine tag can be incorporated conveniently into a target polypeptide that 
is produced, for example, by an in vitro transcription or translation method. 
, 5 A biological macromolecule to be analyzed can be conditioned prior to IR- 

MALDI mass spectrometry analysis. Conditioning improves the ability to 
analyze a particular biological macromolecule by IR-MALDI mass spectrometry, 
for example, by improving the resolution of the mass spectrum. If desired, the 
biological macromolecule can be isolated prior to conditioning or prior to mass 

30 spectrometry analysis. 

A target biological macromolecule can be conditioned, for example, by 
ion exchange, by contact with an alkylating agent or a tria.kylsilyl chloride, or 
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by incorporating at least one mass modified subunit into the biological 
macromolecule. For example, where the biological macromolecule is a nucleic 
acid, the target nucleic acid can be conditioned by phosphodiester backbone 
modification such as by cation exchange; by incorporating at least one 
5 nucleotide such as an N7-deazapurine nucleotide, an N9-deazapurine nucleotide, 
or a 2'-fluoro-2'-deoxynucleotide, each of which can reduce sensitivity of a 
nucleic acid to depurination; by incorporation of at least one mass modified 
nucleotide; or by hybridization of a tag probe to a portion of a nucleic acid 
molecule containing the target nucleic acid (see U.S. Patent No. 5,547,835). 
10 A process for determining the identity of each target biological 

macromolecule in a plurality, of target biological macromolecules can be 
performed, for example, by preparing a composition containing a plurality of 
differentially mass modified target biological macromolecules and a liquid matrix, 
which absorbs infrared radiation; determining the molecular mass of each 
15 differentially mass modified target biological macromolecule in the plurality by 
IR-MALDI mass spectrometry; and comparing the molecular mass of each 
differentially mass modified target biological macromolecule in the plurality with 
the molecular mass of a corresponding known biological macromolecule or 
fragment thereof. Where such a process is performed using a plurality of target 
20 biological macromolecules that are fragments of a biological macromolecule, the 
fragments can be prepared by contacting the biological macromolecules with at 
least one fragmenting agent that cleaves a bond involved in the formation of the 
biological macromolecules, particularly a bond between monomeric subunits of 
the biological macromolecule, to produce the fragment target biological 
25 macromolecules. 

A target nucleic acid to be analyzed by IR-MALDI mass spectrometry can 
be in a biological sample and, if desired, can be amplified prior to analysis, then 
analyzed directly by IR-MALDI mass spectrometry. Alternatively, the amplified 
nucleic acid molecules can be contacted with a detector oligonucleotide, which 
30 can hybridize to a target nucleic acid sequence present in an amplified nucleic 
acid; a composition for IR-MALDI can be prepared by mixing the product of the 
reaction with a liquid matrix, which absorbs infrared radiation; and IR-MALDI 
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mass spectrometry can be performed. Detection of duplex nucle.c acd 
Llecul. which form by hybridization o, the detector oligonucleot.d an en 
:: plifl ed targe, - acid, identifies the presence of the target nucieic acd 

the biological sample. 

location o. nucieic acid molecules, including a target nude, acd 
mo,ecu,e. can be performed using we,, known methods and commerce,* 
avaiiabie Kits. Amplification can utilize a poiymerase, which can he a 
thermostsbie poiymerase, such as Tag DNA poiymerase. Amp Teg 
poiymerase. Deep Ven, <exo-l DNA poiymerase. Ven, DNA poiymer se. Ven, 
n0 Z DNA poiymerase, Vent DNA poiymerase. Vent (exo, DNA poiymerase, 
10 Deep Vent DNA po.ymerase. Thermo Seguenase, exo,, Pse~ _ 
W DNA poiymerase. AmpiiTeg. Uitman. 9 degree Nm, Tth, Hot Tuh, 

U»« «** °' WOeSe ' ^ ' ,r er , end 

location processes include the poiymerase chein reaction , «~ nd 
r- h PrfliBlOSPubl 19941); nucleic acid sequence based ampW.cat.on, 
15 :^Z^Z^ aystem, se,— seguence repiicatio.D- 
L rep lease based amplication; iigation eradication reaction; i.gase : chain 
.action .Wiedmann « *. PCRM^h^L 3-57-64 ,1994,; 
Acad Sci USA 88, 189-93 .1991,); strand disp.acement amplmcat.on (Walker 
20 aTT^Sdi^ **2670-77 ,1994,,; end variations o, these methods, 
-^dii^an^; -erse transcription PC .BT-PCR; Higuch,^, 
gi2£l££!2Q2!aa , 11: 1026-1030 ,1993,,, and a„e,e-spec„c -P'*cat, 

^Ta nucieotide seguence of the terget nucleic acd .s ampimed by 

PC R, we,, Known reaction conditions are used. The minime, com ponents o 
25 amplication reaction InCude a tempiete DNA moiecuie; a forwa. 

rev e,se pdmer. eech of which is capabie of hybridizing to the template D^JA 
rnolecule or a nucleotide seguence linked thereto; each of the 
nucieoside triphosphates o, appropriate anaibgs thereof; an agent for 
polymerization such as DNA polymerase; and a buffer having the approbate 
30 PH. ionic strength, cofactors, and the like. Generally, about 26 to 30 

amplification cycles, each inciuding a deputation step, an enneai.ng step and 
an extension step, are performed, bu, fewer cycles cen be suff.cen, or more 
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cycles can be retiree depending. for exemp,e. on ,he amount of the template 
DNA modules present in «he reaction. Examp.es of PCR reaction conditions 
are described in U.S. Patent No. 5,604.099. 

A nucleic acid sequence can be amplified using PCR as described in U.S. 
f Pa,em T 5 ' 545 - 539 - WWch ~ - *~en, of the basic procadure 
for amplrfying a target nuclaotide seguence by inCuding an effective amount of 
a giycne-based osmolyte in the amplification reaction mixture. Tha use of a 
gfyane-basad osmo.yte improves amplification of seguanoea rich in G and C 
res.dues and. therefore, can be useful, for exampie. to amplify trinucieotide 
repeat seguenoea such as those associated with Fragiia X syndrome ,CGG 
repeats) and myotonic dystrophy (CTG repeats). 

The praaenoa o, a target nucieic acid seguence in a bioiogica, aampia 

b 1, d ^ SPeCi " Ca " y di9eS,in9 nUC ' eiC "» can 

be a ,, fled nucleic acid molecu|es comaining 

Z 2 a , ppropri3te nuclease: hvbridizins ,he di9es,ed nuc,efe «" *~ 

w th omplementary capture nucleic acid seguences. which are immobilized on 
a „d support and can hybridize to a digested fragment of a target nucleic acid; 
prapanng . composition for IR-MALDI, containing the immobilized fragments 

20 : b T R ,r : i which absorbs in,rared radia,i ° n; - 

fragments by IR-MALDI masa apeotrome.ry (see .nternationa, Pubis 

WO S6/2943, and WO 98/20019). The detection of nucleic acid fragments 

that were , mmobjlized by hybridfeation (o comp|ememary capture 

b.oiog.ca, sample. Immobilization of the nucleic acid fragments can be reversed 
25 pnorto performing IR-MALDI or as a consequence of IR-MALDI mass 

spectrometry, for example, due to Ceavage o, an « cl =avab,e Hnxage during ,R- 

iden,ifie T dh Pre T e " 3 ,ar3e ' ""^ *™»° «" b. 

30 amp V ° n " UCleiC aCW m0 ' eCU,eS ° b,aiTOd «™ *• "'ologica, 

c pT a r™ chain — - — • - - - „ 

acid e y ' n9 3 POr,i ° n " nUC ' eiC ~" ""»""» «» «— t nucleic 

acd. prepanng a composition containing the firs, amplication product and a 
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,iauid matrix, which absorbs infrared radiation; and detecting the f.rst 
amplification product in the composition by lR-MALDI mass spectrometry 
thereby detecting the presence of the target nuc.eic acid in the biolog.ca. 
sample. Such a process can include, prior to performing lR-MALDI, a second 
polymerase chain reaction on the first amplication product using a second set 
of primers, which are capable of amplifying at least a portion of the f.rst 
amplification product containing the target nuc.eic acid (Internationa. Publ. 

WO 98/2001 9). . 

Processes for determining the identity of a subunit in a biolog.ca. 
m acromo.ecule, for example, for detecting a mutation in a nucleotide sequence, 
also are provided. The identity of a target nucleotide can be determined by 
hybridizing a nuc.eic acid mo.ecu.e containing the target nucleotide w.th a 
primer o.igonuc.eotide that is complementary to the nuc.eic acid molecule at a 
site adjacent to the target nucleotide; contacting the hybridized nuc.eic ac.d 
molecule with a complete set of dideoxynucleosides or 3'-deoxynuc.eos,de 
triphosphates and a DNA dependent DNA polymerase, so that only the 
dideoxynucleoside or 3'-deoxynuc.eoside triphosphate that is complementary to 
the target nucleotide is extended onto the primer; preparing a composrt.on 
containing the extended primer and a liquid matrix, which absorbs infrared 
20 radiation; and detecting the extended primer in the composition by IR-MALDI 
m ass spectrometry. The identity of the target nucleotide is determined based 
on the dideoxynucleoside or 3<-deoxynuc.eoside triphosphate present m the 
extended primer, as determined by IR-MALDI mass spectrometry. 

The absence or presence of a mutation in a target nucleic acid sequence 
25 also can be determined by hybridizing a nuc.eic acid mo.ecu.e containing the 
target nuc.eic acid sequence with at least one primer, which has 3' term.na. 
base complementarity to the target nucleic acid sequence; contact.ng the 
hybridized nucleic acid with an appropriate polymerase enzyme and sequent.al.y 
with one of the four nucleoside triphosphates; preparing a composrt.on 
30 containing the reaction product and a liquid matrix, which absorbs infrared 
radiation; and detecting the product in the composition by IR-MALDI mass 
spectrometry. Based on the molecular weight of the product, the presence or 
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absence of a mutation next to the 3' end of the primer in the target nucleic acid 
molecule can be determined (International PCT application No. WO 98/20019). 

A mutation in a target nucleic acid molecule also can be detected by 
hybridizing the target nucleic acid molecule with an oligonucleotide probe, to 
5 produce a hybridized nucleic acid, wherein a mismatch is formed at the site of a 
mutation; contacting the hybridized nucleic acid with a single strand specific 
endonuclease; preparing a composition containing the reaction product and a 
liquid matrix, which absorbs infrared radiation; and analyzing the composition by 
IR-MALDI mass spectrometry. The oligonucleotide probe used in this process 
10 has the sequence expected in a normal (unmutated) nucleic acid sequence 
corresponding to the target nucleic acid. The detection by IR-MALDI mass 
spectrometry of more than one nucleic acid fragment in the composition 
indicates that a mismatch was present in the hybridization product formed 
between the target nucleic acid and the oligonucleotide probe and, therefore 
15 that the target nucleic acid molecule contains a mutation (International Publ 
WO 98/20019). 

The absence or presence of a mutation in a target nucleic acid sequence 
also can be identified by performing at least one hybridization of a nucleic acid 
molecule containing the target nucleic acid sequence with a set of ligation 
educts and a DNA ligase; preparing a composition for IR-MALDI containing the 
reaction product and a liquid matrix, which absorbs infrared radiation; and 
analyzing the composition by IR-MALDI mass spectrometry. Using such a 
process, the detection of a ligation product in the composition identifies the 
absence of a mutation in the target nucleic acid sequence, whereas the 
detection only of the set of ligation educts in the composition identifies the 
presence of a mutation in the target nucleic sequence. 

A process of detecting the presence of ligation product by IR-MALDI 
mass spectrometry, as disclosed above, also can detect the presence of a target 
nucle,c acid by performing at least one hybridization on a nucleic acid molecule 
containing the target nucleic acid with a set of ligation educts and a 
thermostable DNA ligase; preparing a composition containing the reaction 
product and a liquid matrix, which absorbs infrared radiation; and identifying a 
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ligation product in the composition by IR-MALDI mass spectrometry. The 
formation of a ligation product indicates the presence of the target nucleic acid. 

A process as disclosed herein also provides a means of using IR-MALDI 
mass spectrometry to determine the identity of a target polypeptide by 
5 comparing the masses of defined peptide fragments of the target polypeptide 
with the masses of corresponding peptide fragments of a corresponding known 
polypeptide. Such a process can be performed, for example, by obtaining the 
target polypeptide by in vitro translation, or by in vitro transcription followed by 
translation of a nucleic acid encoding the target polypeptide; contacting the 
10 translated polypeptide with at least one fragmenting agent that cleaves at least 
one peptide bond in the polypeptide; preparing a composition for IR-MALDI 
containing the peptide fragments and a liquid matrix, which absorbs IR 
radiation; determining the molecular mass of at least one of the peptide 
fragments by IR-MALDI mass spectrometry; and comparing the molecular mass 
15 of the peptide fragments with the molecular mass of peptide fragments of a 
corresponding known polypeptide. The masses of the peptide fragments of a 
corresponding known polypeptide either can be determined in a parallel reaction 
with the target polypeptide, wherein the corresponding known polypeptide also 
is contacted with the agent; can be compared with known masses for peptide 
20 fragments of a corresponding known polypeptide contacted with the particular 
cleaving agent; or can be obtained from a database of polypeptide sequence 
information using algorithms that determine the molecular mass of peptide 
fragment of a polypeptide. Such a process is particularly useful, for example, 
for identifying mutations and, therefore, for screening for certain genetic 
25 disorders, for example, a single base mutation that introduces a STOP codon 
into an open reading frame of a gene, since such a mutation results in 
premature protein truncation; or a change in the encoded amino acid in an allelic 
variant of a polymorphic gene, for example, a single base change that results in 
an amino acid change of alanine to glycine, since polypeptides containing the 
30 different amino acids can be distinguished based on their masses. 
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A process of using IR-MALDI ,„ analyze a target polypeptide to obtain 
."formation regarding the encoding nucleic acid can be used for identifying the 
presence of nuc.eotide repeats, particularly an abnorma! number of nucleotide 
repeats, by determining the identity of a targe, polypeptide encoded by such repeats 

An abnormal number of nucleotide repeats can be identified by using IR-MALDI 
mass spectrometry to compare the mass of a target polypeptide with that of a 
corresponding known polypeptide. 

A target polypeptide can be obtained by translating an RNA molecule 
encod.ng the targe, polypeptide in vitro. If desired, the RNA molecule can be 
0 obtamed by in vitro transcription of a nucleic acid encoding the target 

polypeptide. Translation of a target polypeptide can be effected by directly 
mtroducing an RNA molecule encoding the polypeptide into an in vitro 
translate reaction or by introducing a DNA mo,ecu,e encoding the polypeptide 
mto an in vitro transcription/translation reaction or into an in vitro transcription 
» reaction, then transferring the RNA to an in vitro translation reaction 

In vitro transcription and in vitro translation kits are well known in the art 
and commercially avaiiable. in vitro transition systems include eukaryotic cell 
lysates such as rabbi, reticulocyte lysates. rabbi, oocyte lysates. human cell 
lysates. insect cel. lysates and wheat germ extracts. Such lysates and extracts 
are can be prepared or are commercially available (Promega Corp.; Stratagene 
La Jolla CA; Amersham, Arlington Heights IL; and GIBCO/BRL. Grand Island 
NY). /„ vitro translation systems generally contain macromolecules such as 
enzymes; translation, initiation and elongation factors; chemical reagents- and 
nbosomes. Mixtures of purified translation factors, as we,, as combinations o, 
lysates or lysates supplemented with purified transition factors such as 
initiation factor-, „F-1,. IF-2, IF - 3 (alpha or beta,, elongation factor T (EF-Tu) or 
termination factors, also can be used for mRNA translation in vitro. „ desired 
incubation can be performed in a continuous manner, whereby reagents are ' 
flowed into the system and nascent polypeptides removed or left to accumulate 
usmg a continuous flow system as described by Spirin ex aL (Science 
242;1 , 62-64 ,1988,,. Such a process can be desirable for large scale 
production of nascent polypeptides. 
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An in vitro translation reaction using a reticulocyte lysate, for example, 
can be carried out by mixing ten /yl of a reticulocyte lysate with spermidine, 
creatine phosphate, amino acids, HEPES buffer (pH 7.4), KCI, MgAc and the 
RNA to be translated, and incubated for an appropriate time, generally about 
5 one hour at 30°C. The optimum amount of MgAc for obtaining efficient 

translation varies from one reticulocyte lysate preparation to another and can be 
determined using a standard preparation of RNA and a concentration of MgAc 
up to about 1 mM. The optimal concentration of KCI also can vary depending 
on the specific reaction. For example, 70 mM KCI generally is optimal for 
10 translation of capped RNA, whereas 40 mM generally is optimal for translation 
of uncapped RNA. 

A wheat germ extract can be prepared as described by Roberts and 
Paterson f Proc. Natl. Acad. Sci.. USA 70:2330-2334 (1973)) and can be 
modified as described by Anderson ( Meth. Enzvmol. 101:635 (1983)), if 
15 desired. The protocol also can be modified according to manufacturing protocol 
L418 (Promega Corp.). Generally, wheat germ extract is prepared by grinding 
wheat germ in an extraction buffer, followed by centrifugation to remove cell 
debris. The supernatant is separated by chromatography from endogenous 
amino acids and from plant pigments that are inhibitory to translation. The 
20 extract also is treated with micrococcal nuclease to destroy endogenous mRNA, 
thereby reducing background translation to a minimum. The wheat germ 
extract contains the cellular components necessary for protein synthesis, 
including tRNA, rRNA and initiation, elongation and termination factors. The 
extract can be optimized further by the adding an energy generating system 
25 such as phosphocreatine kinase and phosphocreatine; MgAc is added at a level 
recommended for the translation of most mRNA species, generally about 6.0 to 
7.5 mM magnesium (see, also, Erickson and Blobel Meth. Enzvmol. 96:38 
(1982)), and can be modified, for example, by adjusting the final ion 
concentrations to 2.6 mM magnesium and 140 mM potassium, and the 
30 composition to pH 7.5 (U.S. Patent No. 4,983,521). Translation in wheat germ 
extract also can be performed as described in U.S. Patent No. 5,492,817. 
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For determining the optimal in vitro translation conditions or the extent of 
the reaction, translation of mRNA in an in vitro system can be monitored, for 
example, by mass spectrometry analysis. Monitoring also can be performed, 
for example, by adding one or more radioactive amino acids such as 
35 S-methionine and measuring incorporation of the radiolabel into the translation 
products by precipitating the proteins in the lysate such as with TCA and 
counting the amount of radioactivity present in the precipitate at various times 
during incubation. The translation products also can be analyzed by 
immunoprecipitation or by SDS-polyacrylamide gel electrophoresis (see. for 
example, Sambrook et al.. Molecular Cloning: A laboratory manual (Cold Spring 
Harbor Laboratory Press 1989); Harlow and Lane, Antibodies: A laboratory 
manual (Cold Spring Harbor Laboratory Press 1988}). A labeled non-radioactive 
ammo acid also can be incorporated into a nascent polypeptide. For example, 
the translation reaction can contain a mis-aminoacylated tRNA (U.S. Patent 
No. 5,643,722). A non-radioactive marker can be mis-aminoacylated to a tRNA 
mo.ecu.e and the tRNA amino acid complex is added to the translation system. 
The system is incubated to incorporate the non-radioactive marker into the 
nascent polypeptide and polypeptides containing the marker can be detected 
using a detection method appropriate for the marker. Mis-aminoacylation of a 
tRNA molecule also can be used to add a marker to the polypeptide in order to 
fachtate isolation of the polypeptide. Such markers include, for example, biotin, 
streptavidin and derivatives thereof (U.S. Patent No. 5.643,722). 

In vitro transcription and translation reactions also can be performed 
s.multaneous«y using, for example, a commercially available system such as the 
Coupled Transcription/Translation System (Promega Corp, catalog # L4606 
# 4610 or # 4950). Coupled transcription and translation systems using RNA 
polymerases and eukarvotic lysates are described in U.S. Patent No. 5,324 637 
Coupled in vitro transcription and translation also can be carried out using a 
prokaryotic system such as a bacteria, system, for example, £. co//S30 cell-free 
30 extracts (Zubay, Ann. Rev. G ft n»t 7:267 (1 973)). 

A target polypeptide also can be obtained from a host cell transformed 
with and expressing a nucleic acid encoding the target polypeptide. The nucleic 
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acid encoding the target polypeptide can be amplified, for example, by PCR, 
inserted into an expression vector, and the expression vector introduced into a 
host cell suitable for expressing the polypeptide encoded by the target nucleic 
acid. Host cells can be eukaryotic cells, particularly mammalian cells such as 
human cells, or prokaryotic cells, including, for example, E. coli. Eukaryotic and 
prokaryotic expression vectors are well known in the art and can be obtained 
from commercial sources. Following expression in the host cell, the target 
polypeptide can be isolated using methods as disclosed herein. For example, if 
the target polypeptide is fused to a polyhistidine tag peptide, the target 
polypeptide can be purified by affinity chromatography on a chelated nickel ion 
column. 

A target polypeptide can be produced from an amplified nucleic acid 
encoding the target polypeptide. Where a target polypeptide is produced, for 
example, from an amplified nucleic acid, it can be useful to operably link one or 
more transcription or translation regulatory elements to the nucleic acid or 
encoded polypeptide. Thus, a forward or reverse PCR primer can contain, if 
desired, a nucleotide sequence of a promoter, for example, a bacteriophage 
promoter such as an SP6, T3 or T7 promoter. Amplification of a nucleic 
sequence using such a primer produces an amplified nucleic acid operably linked 
to the promoter, i.e., the promoter is situated in the amplified nucleic acid such 
that it performs the function of a promoter. Such a nucleic acid can be used in 
an in vitro transcription reaction to transcribe the amplified target nucleic acid 
sequence. 

A primer, for example, the forward primer, also can contain regulatory 
sequence elements necessary for translation of an RNA in a prokaryotic or 
eukaryotic system. In particular, where it is desirable to perform a translation 
reaction in a prokaryotic translation system, a primer can contain an operably 
linked prokaryotic ribosome binding sequence (Shine-Dalgarno sequence), which 
is located downstream of a promoter sequence and about 5 to 10 nucleotides 
) upstream of the initiation codon. 

A primer aiso can contain an initiation (ATG) codon, or complement 
thereof, as appropriate, located downstream of a promoter, if present, such that 
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amplification of the target nucleic acid results in an amplified target sequence 
containing an operably linked ATG codon, which is in frame with the desired 
reading frame. The reading frame can be the natural reading frame or can be 
any other reading frame. Where the target polypeptide is not a naturally 
5 occurring polypeptide, operably linking an initiation codon to the nucleic acid 
encoding the target polypeptide allows translation of the target polypeptide in 
the desired reading frame. 

A primer, generally the reverse primer, also can contain a sequence 
encoding a STOP codon in one or more of the reading frames, to assure proper 
10 termination of the target polypeptide. Further, by incorporating into the reverse 
primer sequences encoding three STOP codons, one into each of the three 
possible reading frames, optionally separated by several residues, additional 
mutations that occur downstream (3') of a mutation that otherwise results in 
premature termination of a polypeptide can be detected. 
15 A forward or reverse primer also can contain a nucleotide sequence, or 

the complement of a nucleotide sequence {if present in the reverse primer), 
encoding a second polypeptide. The second polypeptide can be a tag peptide, 
which interacts specifically with a particular reagent, for example, an antibody. 
A second polypeptide also can have an unblocked and reactive amino terminus 
20 or carboxyl terminus. 

The fusion of a tag peptide to a target polypeptide or other polypeptide 
of interest allows the detection and isolation of the polypeptide. A target 
polypeptide encoded by a nucleic acid linked in frame to a sequence encoding a 
tag peptide can be isolated from an in vitro translation reaction mixture using a 
25 reagent that interacts specifically with the tag peptide, then the isolated target 
polypeptide can be subjected to IR-MALDI mass spectrometry, as disclosed 
herein. It should be recognized that an isolated target polypeptide fused to a 
tag peptide or other second polypeptide is in a sufficiently purified form to allow 
IR-MALDI mass spectrometry analysis, since the mass of the tag peptide will be 
30 known and can be considered in the determination. 

Numerous tag peptides and the nucleic acid sequences encoding such 
tag peptides, which aids in isolationg of anything linked thereto, generally 
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contained in a plasmid, are known and are commercially available (NOVAGEN). 
Any peptide can be used as a tag, provided a reagent such as an antibody that 
interacts specifically with the tag peptide is available or can be prepared and 
identified. Frequently used tag peptides include a myc epitope, which includes 
a 10 amino acid sequence from c-myc (see Ellison et aL, J. Biol. Chem. 
266:21150-21157 (199D); the pFLAG system (International Biotechnologies, 
Inc.); the pEZZ-protein A system (Pharmacia); a 16 amino acid peptide portion 
of the Haemophilus influenza hemagglutinin protein; a GST polypeptide; and a 
polyhistidine peptide, which generally contains about four to twelve or more 
contiguous His residues, for example, His-6, which contains six His residues. 
Reagents that interact specifically with a tag peptide also are known in the art 
and are commercially available and include antibodies and various other 
molecules, depending on the tag, for example, metal ions such as nickel or 
cobalt ions, which interact specifically with a His-6 peptide; or glutathione, 
which can be conjugated to a solid support such as agarose and interacts 
specifically with GST. 

A second polypeptide also can be designed to serve as a mass modifier 
of the target polypeptide encoded by the target nucleic acid. Accordingly, a 
target polypeptide can be mass modified by translating an RNA encoding the 
target polypeptide operably linked to a mass modifying amino acid sequence, 
where the mass modifying sequence can be at the amino terminus or the 
carboxyl terminus of the fusion polypeptide. Modification of the mass of the 
polypeptide derived from such a recombinant nucleic acid is useful, for example, 
when several polypeptides are analyzed in a single IR-MALDI mass 
spectrometric analysis, since mass modification can increase resolution of a 
mass spectrum and allow for analysis of two or more different target 
polypeptides by multiplexing. 

Tagged peptides 

Polypeptides can be modified by addition of a peptide or polypeptide 
) fragment to the target polypeptide. For example, a target polypeptide can be 
modified by translating the target polypeptide to include additional amino acids, 
such as polyhistidine, polylysine or polyarginine. These modifications serve aid 
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in purification, identification, and immobilization (and also in IR mass 
spectrometry,. Modifications can be added post-trans.ational.y or can be 
encoded by a recombinant nucleic acid containing a sequence of nucleics that 
encode the target polypeptide. 
5 Where e plurelity of target polypeptides is to be differentially mass 

mod,f,ed, each target polypeptide in the plurality can be mass modified, for 
example, using a different polyhistidine sequence, for example. His-4 His-5 
H.s-6. and so on. The use of such a mass modifying moiety provides the 
further advantage that the moiety acts as a tag peptide, which can be usefu, 
for example, for isolating the target polypeptide attached .hereto. Accordingly 
the d,sc,osed processes permit multiplexing to be performed on a plurality of ' 
polypeptides, and. therefore, are usefu. for determining the amino acid 
sequences of each of a plurality of polypeptides, particularly a plurality of target 
polypeptides. 

5 Primers for amplification can be selected such that the amplification 

reacon produces a nucleic acid that, upon transcription and translation, results 
,n a non-naturaHy occurring polypeptide, for example, a polypeptide encoded by 
an open reading frame that is no, a reading frame encoding a naturally occurring 
polypeptide. Accordingly, by appropriate primer design, in particular by 
•nclud.ng an initiation codon in the desired reading frame and. if present • 
downstream of a promoter in the primer, a polypeptide produced from a target 
nuc,e,c acid can be encoded by one of the two non-coding frames o, the nucleic 
acd. Such a method can be used to shift ou, of frame STOP codons. which 
prematura truncate a protein and exclude relevant amino acids, or to make a 
polypeptide containing an amino acid repeat more so.uble. Primers useful for 
effecting the modifications disclosed herein can be obtained from commercial 
sources or can be synthesized using, for example, the phosphotriester method 
(see Narang e, af. Meth^_Enzymgl. 68:90 (1 979>; U.S. Patent No. 4 356 270- 
see, also U.S. Patent Nos. 5,547.835; 5,605,798; and 5,622 824) ' ' 

A non-naturally occurring target polypeptide also can be encoded by e 5" 
or 3 non-coding region o, an exonic region of a nucleic acid; by an intron; or by 
a regulatory element such as a promoter sequence the, contains, in one of the 
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six frames (3 frames per strand), at least a portion of an open reading frame. In 
these situations, one primer for amplification of the target nucleic acid contains 
a promoter and an initiation codon, such that the amplified nucleic acid can be 
transcribed and translated in vitro. Thus, a method for determining the identity 
of a target polypeptide, as disclosed herein, permits the determination of the 
identity of a nucleotide sequence located in any region of a chromosome, 
provided a polypeptide of at least 2 amino acids, generally at least 3 or 4 amino 
acids, particularly at least 5 amino acids, is encoded by one of the six frames of 
the polynucleotide. Accordingly, a process as disclosed herein can be used to 
determine a nucleotide sequence of an unknown nucleic acid directly, or 
indirectly by comparing the amino acid sequence of a polypeptide encoded by 
the unknown nucleic acid with the amino acid sequence of a polypeptide 
encoded by a corresponding known nucleic acid. Where the nucleotide 
sequence is determined based on the amino acid sequence of an unknown 
polypeptide, the determined nucleotide sequence of the unknown polynucleotide 
can be the same as a naturally occurring nucleotide sequence encoding the 
polypeptide, or can be different from the naturally occurring sequence due to 
degeneracy of the genetic code. 

The method designated primer oligo base extension (PROBE) can be used 
herein. This method uses a single detection primer followed by an 
oligonucleotide extension step to give products, which can be readily resolved 
by IR-MALDI mass spectrometry. The products differ in length by a number of 
bases specific for a number of repeat units or for second site mutations within 
the repeated region. The method is advantageously used for example, for 
determining identity, identifying mutations, familial relationship, HLA 
compatability and other such markers .using PROBE-MS analysis of 
microsatellite DNA. In a preferred embodiment, the method includes the steps 
of: 

a) obtaining a biological sample from two individuals; 
i b) amplifying a region of DNA from each individual that contains two 

or more microsatellite DNA repeat sequences 
c) ionizing/volatizing the amplified DNA; 
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d) detecting the presence of the amplified DNA and comparing the 
molecular weight of the amplified DNA. 
Different sizes are indicative of non-identity (Le, wild-type versus mutation), 
non-heredity or non-compatibility; similar size fragments indicate the possibility 
identity, of familial relationship, or HLA compatibility. More than one marker 
may be examined simultaneous, primers with different linker moieties are used 
for immobilization. 

Another method loop-primer oligo base extension, designated LOOP- 
PROBE, for detection of mutations especially predominant disease causing 
mutations or common polymorphisms can also be used in the IR-MALDI formats 
provided herein. In a particular embodiment, this method for detecting target 
nucleic acid in a sample, includes the steps of: 

a) amplifying a target nucleic acid sequence, such as y?-globin, in a 
sample, using (i) a first primer whose 5'-end shares identity to a 
portion of the target DNA immediately downstream from the 
targeted codon followed by a sequence that introduces a unique 
restriction endonuclease site, such as Cfol in the case of 0-globin. 
into the amplicon and whose 3'-end primer is self-complementary; 
and (ii) a second downstream primer that contains a tag, such as 
biotin, for immobilizing the DNA to a solid support, such as 
streptavidin beads; 
O immobilizing the double-stranded amplified DNA to a solid support 
via a linker moiety; 

d) denaturing the immobilized DNA and isolating the non-immobilized 
DNA strand; 

e) annealing the intracomplementary sequences in the 3'-end of the 
isolated non-immobilzed DNA strand, such that the 3'-end is 
extendable by a polymerase, which annealing can be performed, 

, for example, by heating then and cooling to about 37° C, or other 
suitable method; 

f) extending the annealed DNA by adding DNA polymerase, 

3 dNTPs/1 ddNTP, whereby the 3'-end of the DNA strand is 
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extended by the DNA polymerase to the position of the next 
ddNTP location ( i.e. , to the mutation location); 
g) cleaving the extended double stranded stem loop DNA with the 
unique restriction endonuclease and removing the cleaved stem 
loop DNA 

i) (optionally adding a matrix, particularly a liquid matrix as defined 

herein) ionizing/volatizing the extended product; and 
j) detecting the presence of the extended target nucleic acid, 

whereby the presence of a DNA fragment of a mass different 
from wild-type is indicative of a mutation at the target codon(s). 
This method eliminates one specific reagent for mutation detection compared 
other methods of MS mutational analyses, thereby simplifying the process and 
rendering it amenable to automation. Also, the specific extended product that is 
analyzed is cleaved from the primer and is therefore shorter compared to the 
other methods. In addition, the annealing efficiency is higher compared to 
annealing of an added primer and should therefore generate more product. The 
process is compatible with multiplexing and various detection schemes (e^, 
single base extension, oligo base extension and sequencing). For example, the 
extension of the loop-primer can be used for generation of short diagnostic 
sequencing ladders within highly polymorphic regions to perform, for example, 
HLA typing or resistance as well as species typing. 
Genotying and phenotyping 

A process for determining the identity of an allelic variant of a 
polymorphic region of a gene, particularly a human gene, also is provided. 
Allelic variants can differ in the identity of a single nucleotide or base pair, for 
example, by substitution of one nucleotide; in two or more nucleotides or base 
pairs; or in the number of nucleotides due, for example, to additions or deletions 
of nucleotides or of trinucleotide repeats; or due to chromosomal 
rearrangements such as translocations. Specific allelic variants of polymorphic 
regions are associated with specific diseases and, in some cases, correlate with 
the prognosis of the disease. 
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Also provided is a process for determining the genetic nature of a 
phenotype or for identifying a predisposition to that phenotype. For example, it 
can be determined whether a subject has a predisposition to a specific disease 
or condition, i.e., whether the subject has, or is at risk of developing, a disease 
> or condition associated with a specific allelic variant of a polymorphic region of 
a gene. Such a subject can be identified by determining whether the subject 
carries an allelic variant associated with the specific disease or condition. 
Furthermore, if the disease is a recessive disease it can be determined whether 
a subject is a carrier of a recessive allele of a gene associated with the specific 
disease or condition. 

Numerous diseases or conditions have been genetically linked to a 
specific gene and, more particularly, to a specific mutation or genetic lesion of a 
gene. For example, hyperproliferative diseases such as cancers are associated 
with mutations in specific genes. Such cancers include breast cancer, which 
has been linked to mutations in BRCA1 or BRCA2. Mutant alleles of BRCA1 are 
described, for example, in U.S. Patent No. 5,622,829. Other genes such as 
tumor suppressor genes, which are associated with the development of cancer 
when mutated, include, but are not limited to, p53 (associated with many forms 
of cancer); Rb (retinoblastoma); WT1 (Wilm's tumor) and various 
proto-oncogenes such as c-myc and c-fos (see Thompson and Thompson, 
Genetics in Medicine 5th ed.; Nora et aL, Medical Genetics 4th ed. (Lea and 
Febiger, eds.). 

A process as disclosed herein also can be used to detect DNA mutations 
that result in the translation of a truncated polypeptide, as occurs, for example, 
with BRCA1 and BRCA2. In one embodiment, translation of nucleic acid 
regions containing such a mutation results in a truncated polypeptide, which 
easily can be differentiated from the corresponding non-truncated polypeptide 
by IR-MALDI mass spectrometry. 

A process as disclosed herein also can be used to genotype a subject, 
for example, a subject being considered as a recipient or a donor of an organ or 
a bone marrow graft. For example, the identity of MHC alleles, particularly HLA 
alleles, in a subject can be determined. The information obtained using such a 
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method is useful because transplantation of a graft to a recipient having 
different transplantation antigens than the graft can result in rejection of the 
graft and can result in graft versus host disease following bone marrow 
transplantation. 

5 The response of a subject to medicaments can be affected by variations 

in drug modification systems such as the cytochrome P450 system, and 
susceptibility to particular infectious diseases can be influenced by genetic 
status. Genes involved in pharmacogenetics are well known (Nora et aL, 
Medical Genetics 4th ed. (Lea and Febiger, eds.)). Thus, the identification of 
10 particular allelic variants can be used to predict the potential responsiveness of 
a subject to specific drug or the susceptibility of a subject to an infectious 
disease. 

Some polymorphic regions may not be related to any disease or 
condition. For example, many loci in the human genome contain a polymorphic 

15 short tandem repeat (STR) region. STR loci contain short, repetitive sequence 
elements of 3 to 7 base pairs in length. It is estimated that there are 200,000 
expected trimeric and tetrameric STRs, which are present as frequently as once 
every 15 kb in the human genome (see, e^, International Publ. WO 92/13969; 
Edwards et ah, Nucl. Acids Res. 1 9:4791 (1 991 ); Beckmann et aL, Genomics 

20 12:627-631 (1992)). Nearly half of these STR loci are polymorphic, providing a 
rich source of genetic markers. Variation in the number of repeat units at a 
particular locus is responsible for the observed polymorphism reminiscent of 
variable nucleotide tandem repeat (VNTR) loci (Nakamura et aL, Science 
235:1616-1622 (1987)); and minisatellite loci (Jeffreys et at., Nature 

25 314:67-73 (1985)), which contain longer repeat units, and microsatellite or 

dinucleotide repeat loci <l nty et aL. Nucl. Acids Res. 19:4308 (1991); Litt et aL, 
Nucl. Acids Res. 18:4301 M990L- Litt et aL. Nucl. Acids Res. 18:5921 (1990); 
I uty P.t aL. Am. J. Hum. Genet. 46:776-783 (1990); Tant7. Nucl. Acids Res. 
17:6463-6471 MQ«m- Weber et aL. Am. J. Hum. Genet. 44:388-396 (1989); 

30 Berkmann et aL. Genomics 12:627-631 (1992)). 

Polymorphic STR loci and other polymorphic regions of genes are 
extremely useful markers for human identification, paternity and maternity 
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testing, genetic mapping, immigration and inheritance disputes, zygosity testing 
in twins, tests for inbreeding in humans/quality control of human cultured cells, 
identification of human remains, and testing of semen samples, blood stains and 
other material in forensic medicine. Such loci also are useful markers in 
5 commercial animal breeding and pedigree analysis and in commercial plant 

breeding. Traits of economic importance in plant crops and animals also can be 
identified through linkage analysis using polymorphic DNA markers. 

STR loci can be amplified by PCR using specific primer sequences 
identified in the regions flanking the tandem repeat to be targeted. Allelic forms 
1 0 of these loci are differentiated by the number of copies of the repeat sequence 
contained within the amplified region. Examples of STR loci include 
pentanucleotide repeats in the human CD4 locus (Edwards et aL, Nucl. Acids 
RejL 19:4791 (1991),; tetranucleotide repeats in the human aromatase 
cytochrome P-450 gene (CYP19; Polymeropoulos et aL, Nucl. Acids Rps 
15 19:195 (1991)); tetranucleotide repeats in the human coagulation factor XIII A 
subunit gene (F13A1; Polymeropoulos et aL, Nucl. Acids Rps 19:4306 (1991)); 
tetranucleotide repeats in the F1 3B locus (Nishimura et aL, Nucl. Acids Res. 
20:1 167 (1992)); tetranucleotide repeats in the human c-les/fps, proto- 
oncogene (FES; Polymeropoulos et al., Nucl. Acids Rps 19:4018 (1991)}; 
20 tetranucleotide repeats in the LFL gene (Zuliani et aL, Nucl. Acids Rps 18:4958 
(1990)); trinucleotide repeat polymorphisms at the human pancreatic 
phospholipase A-2 gene (PLA2; Polymeropoulos et aL, Nucl. Acids Rps 
18:7468 (1990)); tetranucleotide repeats polymorphism in the VWF gene (Ploos 
et aL, Nucl. Acids Res. 18:4957 (1990)); and tetranucleotide repeats in the 
25 human thyroid peroxidase (hTPO) locus (Anker et aL, Hum. Mol. GpnPt 1 -1 37 
(1992)). 

Diagnosis of genetic diseases and infectious diseases 
Depending on the target biological macromolecule to be detected, the 
disclosed processes allow the diagnosis, for example, of a genetic disease or 
30 chromosomal abnormality; a predisposition to or an early indication of a gene 
influenced disease or condition such as obesity, atherosclerosis, diabetes or 
cancer; or an infection by a pathogenic organism, including a virus, bacterium. 
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parasite or fungus; or provide information relating to identity or heredity based, 
for example, on an analysis of mini-satellites and micro-satellites, or to 
histocompatibility based, for example, on HLA phenotyping. Accordingly, 
processes are provided for detecting genetic lesions that are charactenzed, for 
5 example, by an abnormal number of trinucleotide repeats, which can range from 
less than 10 to more than 100 additional trinucleotide repeats relative to the 
number of repeats, if any, in a gene in a non-affected individual, by using IR- 
MALDI mass spectrometry to analyze an encoding target nucleic acid or an 
encoded target polypeptide, as disclosed herein. 
10 Diseases associated with genetic lesions characterized by nucleotide 

repeats include, for example,. Huntington's disease, prostate cancer, SCA-1 , 
Fragile X syndrome (Kremer et aL, Science 252:1 71 1-14 (1 991 ); Fu et aL, CeM 
67-1047-58 (1991)); Hirst etaL. J. Med. Genet, 28:824-29 (1991))). myoton.c 
dystrophy type . (Mahadevan et aL, Science 255:1253-55 (1992); Brook et aL. 
15 Celt 68-799-808 (1992)), Kennedy's disease (also termed spinal and bulbar 

muscular atrophy; La Spada et aL, Nature 352:77-79 (1991)); Machado-Joseph 
disease, and dentatorubral and pallidolyusian atrophy. The abnormal number of 
triplet repeats can be located in any region of a gene, including a coding region, 
a non-coding region of an exon, an intron, or a promoter or other regulatory 
20 element. For example, the expanded trinucleotide repeat associated with 
myotonic dystrophy occurs in the 3' untranslated region (UTR) of the MtPK 
gene on chromosome 19. In some of these diseases, for example, prostate 
cancer, the number of trinucleotide repeats is positively correlated with 
prognosis of the disease such that a higher number of trinucleotide repeats 
25 correlates with a poorer prognosis. 

Hence, the process for detecting nucleic acids by IR-MALDI mass 
spectrometry can be useful, for example, for diagnosing the existence of any 
one of the more than 3000 known genetic diseases (Cooper and Krawczak, 
"Human Genome Mutations" (BIOS Publ. 1993)), including hemophilias, 
30 thalassemias, Duchenne muscular dystrophy, Huntington's disease, Alzheimer's 
disease and cystic fibrosis, or other genetic disease to be identified. In addition, 
the processes can be useful for diagnosing certain birth defects that are the 
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resul, of chromosomal abnormalities such as trisomy 21 (Down's syndrome, 
tnsomy ,3 (Patau syndrome), trisomy 18 (Edward . s Syndrome) , monos£)my x 
(Turner's syndrome) and other sex chromosome aneuploidies such as 
Klinefelter syndrome (XXY). The processes a,so can be used to detec, certain 
DMA sequences that may predispose an individual to any of a number of 
d.seases. including, for example, diabetes, arterioscierosis. obesity, various 
autcmmune diseases and cancers such as colorectal, breast, ovarian, prostate 
and iung cancer, or that render an individual sui,ab,e or unsuitable for a 
particular medical treatment. 
10 Alternatively, the processes can be used to detect nuCeic acids tha, are 

characteristic o, viruses, bacteria, fungi or other infectious organisms, which 
have nucleic acid sequences tha, are different from the sequences normaHy 
contarned in the host cell. The processes also can be used to detect 
characteristic nucieic acid sequences that provide information re.ating to 
15 identity, heredity or compatibility. 

eusing viruses tha, infect humans and animals and that may be 
detected by a disclosed process include, bu, are no, limited ,o. „e„* e 
^ human immunodeficiency viruses such as H,V-, (also referred ,o as HTLV- 

LAV or HTLV-„„LAV; Ra,ne rfilaL ^ 313:227-284 ,1985); Wain 
Hobso aM m 40:9O7 (1985)) 2 (Quyader ^ ^ 

328 543 olTn or 6 " 1 PUb ' i0a,iOn ° 520 '' Chak ' atert « ^ 
328.543-547 ,1987); European Paten, Appiication No. 0 655 501). and other 

.eolates such as H,V-LP „n,ernationa, Pub,. WO 94/00562,; P>co mavi r Uae (sJL 
pol,ov,ruses, hepatitis A virus, (Gus, et ah. Injeryuojoc* 20:1-7 (1 983,,- 
enteroviruses, human coxsacxie viruses, rhinoviruses. echoviruses); C aMvMae 
<e.g. s,ra,ns tha, cause gastroenteritis,; 7b„e ^ equine encepha „, is 
vruses rubella viruses,; Flavindae ^ dengue ^ ^ 
yellow fever viruses,; Co„ dae (^ coronaviruses); R habdovMdae 
vesicular stomatitis viruses, rabies viruses,; H,o vMae ^ ebo|a " 
^amyxovWae ^ parainfluenza viruses, mumps virus, measles virus 
resprratory syncytia, virus,; Ort h o m y xovi n daa te influen2a viruses) . 
Bun gamidae ^ Han , aan viruses , bunga vjruses ph|ebovimses ^ 
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viruses); Arenaviridae (hemorrhagic fever viruses); Reoviridae (e^, reoviruses, 
orbiviruses and rotaviruses); Birnaviridae; Hepadnaviridae (Hepatitis B virus); 
Parvoviridae (parvoviruses); Papovaviridae; Hepadnaviridae (Hepatitis B virus); 
Parvoviridae (most adenoviruses); Papovaviridae (papilloma viruses, polyoma 
5 viruses); Adenoviridae (most adenoviruses); Herpesviridae (herpes simplex virus 
type 1 (HSV-1) and HSV-2, varicella zoster virus, cytomegalovirus, herpes 
viruses; Poxviridae (variola viruses, vaccinia viruses, pox viruses); Iridoviridae 
(e.g., African swine fever virus); and unclassified viruses (e^, the etiological 
agents of Spongiform encephalopathies, the agent of delta hepatitis (thought to 
10 be a defective satellite of hepatitis B virus), the agents of non-A, non-B hepatitis 
(class 1 = internally transmitted; class 2 = parenteral^ transmitted, i.e., 
Hepatitis C); Norwalk and related viruses, and astroviruses. 

Examples of infectious bacteria include Helicobacter pyloris, Borelia 
burgdorferi, Legionella pneumophila, Mycobacteria sp. (e^ M. tuberculosis, M. 
15 avium, M. intracellular, M. kansaii, M. gordonae), Staphylococcus aureus. 
Neisseria gonorrheae, Neisseria meningitidis. Listeria monocytogenes, 
Streptococcus pyogenes (Group A Streptococcus), Streptococcus agalactiae 
(Group B Streptococcus), Streptococcus sp. (viridans group). Streptococcus 
faecalis. Streptococcus bovis. Streptococcus sp. (anaerobic species), 
20 Streptococcus pneumoniae, pathogenic Campylobacter sp., Enterococcus sp., 
Haemophilus influenzae, Bacillus antracis, Corynebacterium diphtheriae, 
Corynebacterium sp., Erysipelothrix rhusiopathiae, Clostridium perfringens, 
Clostridium tetani, Enterobacter aerogenes, Klebsiella pneumoniae, Pasturella 
multocida, Bacteroides sp., Fusobacterium nucleatum, Streptobacil/us 
25 moniliformis, Treponema pallidium, Treponema pertenue, Leptospira, and 
Actinomyces israelii. 

Examples of infectious fungi include but are not limited to Cryptococcus 
neoformans, Histoplasma capsu/atum, Coccidioides immitis, Blastomyces 
dermatitidis, Chlamydia trachomatis, Candida albicans. Other infectious 
30 organisms include protists such as Plasmodium falciparum and Toxoplasma 
gondii. 

Releasable Mass-Label Molecules 
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IR-MALDI MS may be used in conjunction with mass-label molecules for 
the detection and identification of target molecules. Releasable mass-label 
molecules have been described in PCT Application Publication No. WO 
98/26095, which is incorporated in its entirety by reference herein. In these 

5 methods, the target molecule is linked to a mass-label through an element that 
is specific for the target. The target is "indirectly" detected after release of the 
mass-label from the target molecule and detection of the mass-label by IR- 
MALDI MS. The mass value of the label identifies and characterizes the 
element specific for the target. Thus, detection of the mass-label, instead of 

► the target molecule itself, is indicative of the presence of the target molecule in 
a sample. 

Any of the methods of performing IR-MALDI mass spectrometry as 
described herein may be used to detect the mass label. For example, the mass 
label may be mixed with any of the matrices as described herein and subjected 
to IR-MALDI mass spectrometry. In particular embodiments, the mass label is 
mixed with a glycerol matrix prior to performing IR-MALDI mass spectrometry. 

The mass label is contained within a release tag compound which further 
contains one or more reactive groups and one or more release groups. The 
reactive group reacts with the target molecule. The mass label is linked, or 
attached, to the reactive group via a releasable attachment. Typically, the mass 
label is released from all or a part of the reactive group prior to mass spectral 
analysis. This releasable attachment typically occurs through the use of a 
release group which may be the linkage between the mass label and the 
reactive group or which may comprise a portion or all of the reactive group or 
which may be contained within the reactive group. 

Typical target molecules include polynucleotides, gene sequences, 
mutations within a gene or protein sequence, toxins, metals, receptors, ' 
antigens, ligands, polypeptides, carbohydrates and lipids. 
The mass label 

The mass label (also referred to as a tag) may be any compound that 
may be detected by mass spectrometry and includes synthetic polymers and 
biopolymers. Synthetic polymers include polyethylene glycol, polyvinyl phenol. 
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polypropylene glycol, polymethyl methacrylate, and derivatives thereof. 
Synthetic polymers typically contain monomer units including ethylene glycol, 
vinyl phenol, propylene glycol, methyl methacrylate, and derivatives and 
combinations thereof. Biopolymers include those comprising monomer units 
5 such as amino acids, non-natural amino acids, peptide mimics, nucleic acids, 
nucleic acid mimics and analogs, and saccharides and combinations thereof. In 
certain embodiments, the mass label has a molecular weight greater than about 
500 Daltons. In some embodiments, the mass label may be nonvolatile 
(including involatile), whereas in other embodiments, volatile mass labels may 
10 be used. Other mass labels include heme groups, dyes, organometallic 
compounds, steroids, fullerenes, retinoids, carotenoids and polyaromatic 
hydrocarbons. 

The reactive group 

The reactive group refers to a group capable of reacting with the 
15 molecule whose presence is to be detected. For example, the reactive group 
may be a biomolecule capable of specific molecular recognition. Biomolecules 
capable of specific molecular recognition may typically be any molecule capable 
of specific binding interactions with unique molecules or classes of molecules, 
including but not limited to peptides, polypeptides, proteins and polynucleic 
20 acids. Polypeptides include peptides comprising two or more native or non- 
native amino acid monomers such as native proteins, gene products, protein 
conjugates, mutant or polymorphic polypeptides, post-translationally modified 
proteins, genetically engineered gene products including products of chemical 
synthesis, in vitro translation, cell-based expression systems, including fast 
25 evolution systems involving vector shuffling, random or directed mutagenesis 
and peptide sequence randomization, oligopeptides, antibodies, enzymes, 
receptors, regulatory proteins, nucleic acid-binding proteins, hormones, or 
protein products of a display method such as phage or bacterial display 
methods. 

30 Nucleic acids include standard or naturally-occurring as well as 

modified/non-natural nucleic acids, often known as nucleic acid mimics or 
mimetics. Thus, nucleotides refer to both naturally-occurring and modified/non- 
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naturally occurring nucleotides, including nucleoside tri-, di-, and 
monophosphates as well as monophosphate monomers present within 
polynucleic acid or oligonucleotide. A nucleotide may be a ribo, 2'-deox y/ 2\3'- 
deoxy as well as a vast array of other nucleotide mimics that are well known in 
i the art. Mimics include chain-terminating nucleotides, such as 3'-0-methyl, 
halogenated base or sugar substitutions, alternative sugar structures including 
nonsugar, alkyl ring structures, alternative bases including inosine, deaza- 
modified, chi and psi linker-modified, mass label-modified, phosphodiester 
modifications or replacements including phosphorothioate, methylphosphonate, 
boranophosphate, amide, ester, ether and a basic or complete internucleotide 
replacement, including cleavage linkages such a photocleavable nitrophenyl 
moiety. These modifications are well known in the art and based on 
fundamental principles as described in Saenger (1983) Principles of Nucleic Acid 
Structure, Springer-Verlag, NY. 

Polynucleic acids include molecules containing more than one nucleic 
acid. Polynucleic acids include lengths of two or more nucleotide monomers 
and encompass nucleic acids, oligonucleotides, oligos, polynucleotides, DNA, 
genomic DNA, mitochondrial DNA, copy DNA, bacterial DNA, viral DNA, viral 
RNA, RNA, message RNA, transfer RNA, ribosomal RNA, catalytic RNA, clones, 
plasmids, M13, P1, cosmid, bacteria artificial chromosome, yeast artificial 
chromosome, mammalian artificial chromosome, amplified nucleic acid, 
amplicon, PCR product and other types of amplified nucleic acid. 

A reactive group may be an oligonucleotide having one or more 
nucleotides or oligonucleotide(s) added after hybridization of the reactive group 
to a complementary nucleic acid sequence. A nucleotide added after 
hybridization may have a chain-terminating modification, for example, a chain- 
terminating dideoxy nucleotide. The added nucleotide may also contain a 
functional group capable of being immobilized on a solid support, for example, a 
biotin or digoxigenin. Generally, this functional group or binding group or 
moiety is capable of attaching or binding the tag compound to the solid support. 
The binding moiety may be attached to the added nucleotide or oligonucleotide 
directly through an intervening linking group or by specific hybridization to an 
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intermediary oligonucleotide which is itself bound to a solid support. Binding 
moieties include functional groups for covalent bonding to a solid support, 
ligands that attach to the solid support via a high-affinity, noncovalent 
interaction (such as biotin with streptavidin), a series of bases complementary 
5 to an intermediary oligonucleotide which is itself attached to the solid support, 
as well as other means that are well-known to those of skill in the art, such as 
those described elsewhere herein and in PCT Application Publication Nos. 
WO96/37630, W096/29431 , WO98/2001 9, W094/1 61 01 , WO 98/20166, 
each of which is incorporated in its entirety by reference herein. 
10 The reactive group may also contain a nuclease blocking moiety which 

serves to block the digestion of the oligonucleotide by the nuclease, such as an 
exonuclease. Typical nuclease blocking moieties include phosphorothioate, 
alkylsilyldiester, boranophosphate, methylphosphonate and peptide nucleic acid. 
The releasable attachment 
1 5 The mass label is linked, or attached, to the reactive group via a 

releasable attachment. The release group may be any labile group providing for 
such a releasable attachment. The release group may thus be a chemically 
cleavable linkage or labile chemical linkage. Such linkages may typically be 
cleaved by methods that are well known to those of skill in the art, such as by 
20 acid, base, oxidation, reduction, heat, light, or metal ion catalyzed, 

displacement or elimination chemistry. For example, the chemically cleavable 
linkage may contain a modified base, a modified sugar, a disulfide bond, a 
chemically cleavable group incorporated into the phosphate backbone, or a 
chemically cleavable linker. Some examples of these linkages are described in 
25 PCT Application Publication no. W096/37630. Chemically cleavable groups 

that may be incorporated into the phosphate backbone are well known to those 
of skill in the art and may include dialkoxysilane, 3'-{S)-phosphorothioate, 5'-(S)- 
phosphorothioate, 3'-(N)-phosphoroamidate, or 5'-(N)-phosphoroamidate. The 
chemically cleavable linker may be a modified sugar, such as ribose, or the 
30 linkage may be a disulfide bond. 

When the releasable attachment is contained within the reactive group, 
the release of the releasable attachment may be activated by a selective event. 
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For example, the selective release can be mediated by an enzyme such as an 
exonuclease specific for double-stranded or single-stranded DNA. Generally, 
when the releasable attachment is contained within the reactive group, the 
reactive group contains within its structure the particular release group which 
5 will cause the mass label to disconnect from the tag component. 

The release groups include groups or linkages cleavable by an enzyme. 
Enzymatically cleavable release groups include phosphodiester or amide linkages 
as well as restriction endonuclease recognition sites. Nucleases for cleaving 
release groups include exonucleases and restriction endonucleases. Typical 
) exonucleases include exonucleases specific for both double-stranded and single- 
stranded polynucleic acids. Additionally, restriction endonucleases include Type 
US and Type II restriction endonucleases. The release group may be cleavable 
by a protease, including endoproteinases. 

Furthermore, the reactive group may contain a nucleoside triphosphate or 
may be synthesized using mass-labeled nucleoside triphosphates. The labeled 
probes may include at least two unique mass labels. 

Exemplary release tag compounds 
Exemplary release tag compounds include those in which the reactive 
group is a double-stranded oligonucleotide containing a restriction endonuclease 
recognition site, the releasable attachment contains a phosphodiester linkage 
capable of being cleaved by a restriction endonuclease and the mass label is one 
detectable by mass spectrometry. The reactive group may further include a 
modified nucleotide and the mass label may include a portion of the reactive 
group. Double-stranded oligonucleotides include not only two complementary 
strands hybridized to each other by hydrogen bonding interactions, but also 
include single strands of nucleotides wherein portions of the strand are single- 
stranded and portions are double-stranded. For example, portions or all of the 
reactive group may include a self-complementary oligonucleotide hairpin where 
part of the reactive group is complementary to another part of the reactive 
group. In this case, certain conditions allow the formation of a double-stranded 
duplex between these two portions of the reactive group. It is not necessary 
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that all of the reactive group be double-stranded; release tag compounds 
containing single-stranded regions are also included. 

Further exemplary release tag compounds include those in which the 
reactive group is a double-stranded oligonucleotide, the releasable attachment is 
5 a chemically cleavable release group and the mass label is one detectable by 
mass spectrometry. In this instance, the releasable attachment is typically 
located within the reactive group. Cleavage at the chemically cleavable release 
group is generally inhibited in this aspect by the presence of a double-stranded 
oligonucleotide at the release group. Chemically cleavable release groups, such 
10 as 3'-(S)-phosphorothioate, 5'-(S)-phosphorothioate, 3'-(N)-phosphoroamidate, 
5'-(N)-phosphoroamidate or ribose may be employed with these embodiments. 
A portion of the reactive group may be rendered single-stranded at the release 
group by hybridization of a portion of the reactive group to a target nucleic acid. 
A set of release tags (i.e., a group of two or more release tag 
15 compounds) may also be used for detecting a target nucleic acid. In this 

instance, the target nucleic acid typically contains more than one release tag 
compound. Each release tag compound includes a reactive group, a releasable 
attachment and a mass label. The reactive group may be an oligonucleotide 
including a variable region and an invariant region, the releasable attachment is 
20 a release group and the mass label is one detectable by mass spectrometry. The 
invariant and variable regions react with the target nucleic acid. Generally, 
each release tag compound of the set will be different from all other members of 
the group. That is, each member will include a different combination of reactive 
group, release group and mass label. Typically, the mass label of at least one 
25 member of the set may identify a specific sequence within the variable region. 
In some instances, the mass label for each member of the set may uniquely 
identify each different sequence within the variable region. In other instances, a 
combination of the mass labels of two or more release tag compounds may 
identify each different sequence within the variable region. 
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Preparation of mass-label probes 
Methods of producing mass-labeled probes include combining nucleoside 
or amino acid monomers with at least one mass-labeled monomer under 
conditions that allow for polymerization. The polymerization may be mediated, 
5 for example, by an enzyme or by chemical synthesis. Synthetic methods for 
preparing the mass-label probes are essentially those for standard peptide and 
DNA synthesis. 

Methods for detecting target molecules using mass-labeled probes 
Generally, one method for detecting a target molecule includes obtaining 
10 a polity of probes, each probe including a reactive group, a release group and 
a mass label as described herein and in PCT Application Publication No 
WO98/26095. Typically, each probe within the p.ura.ity contains a unique 
mass-label. Next, a sample that may or may not contain the target molecule is 
contacted with the plurality of probes under conditions suitable to allow for the 
formation of probertarget molecule complexes. The mass label is released from 
the probe and the mass of the mass-label is determined by IR-MALDI mass 
spectrometry. In a preferred embodiment, the mass label is mixed with a liquid 
matrix in preparation for IR-MALDI mass spectrometry analysis. A particularly 
preferred liquid matrix is glycerol. Typically, the mass is indicative of a specific 
target molecule. In this way, the target molecule can be identified according to 
the unique combination of mass labels. 

In another method for detecting a target molecule, the target molecule is 
amplified, using any method known by one of skill in the art, to produce an 
amplified target molecule. The amplified target molecule is then hybridized with 
a probe such as described herein and in PCT Application Publication No 
WO98/26095 to produce proberamplified target mo.ecu.e complexes. The mass 
label on the amplified target molecule complexes are then released and the mass 
of the mass label is determined by IR-MALDI mass spectrometry analysis In a 
preferred embodiment, the mass label is mixed with a liquid matrix in 
preparation for IR-MALDI mass spectrometry analysis. A particu.ar.y preferred 
hqiiKJ matrix is glycerol. The amplified target mo.ecu.e may also be immobilized 
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onto a solid support, and any probe not part of a probe:amplified target 
molecule complex is removed by washing. 

Multiplexing methods are also provided wherein the target molecule is 
contacted with a plurality of probes. Each reactive group of the probe may be 
associated with a unique mass label or it may be associated with a unique set 
of mass labels. Thus a target molecule may be detected by the mass spectral 
detection of a particular mass label or a particular set of mass labels. Mass 
spectral detection is accomplished using IR-MALDI mass spectrometry. In a 
preferred embodiment, the mass label or labels are mixed with a liquid matrix 
prior to performing IR-MALDI mass spectrometry. A particularly preferred liquid 
matrix is glycerol. Where a set of mass labels is employed, the set of mass 
labels may be attached to the same probe. Alternatively, each member of the 
set may be attached to a different probe. 

In another method for detecting a target molecule, the following steps 
are included: (a) obtaining a probe including a reactive group, a release group 
and a nonvolatile mass label; (b) contacting a target molecule with the probe to 
produce probertarget molecule complexes; (c) selectively releasing the mass 
label from the probe:target molecule complexes to produce released mass 
labels; and (d) determining the mass of the released mass labels by IR-MALDI 
mass spectrometry. In a further method, prior to step (d), the mass label is 
mixed with a liquid matrix, preferably glycerol. 

In another method for detecting a target molecule, the following steps 
are included: (a) obtaining a probe including a reactive group, a release group 
and a mass label; (b) contacting a target molecule with the probe to produce 
probe:target molecule complexes; (c) releasing the mass label from the 
probe:target molecule complexes to produce released mass labels; and (d) 
determining the mass of the released mass labels by IR-MALDI mass 
spectrometry. In a further method, prior to step (d), the mass label is mixed 
with a liquid matrix, preferably glycerol. 

A method for multiplexing the detection of a target molecule includes: (a) 
obtaining a plurality of probes, each probe including a reactive group, a release 
group, and a mass label; (b) contacting the target molecule with the plurality of 
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probes to produce probe:target molecule complexes; (c) releasing the mass label 
from any probe belonging to the probertarget molecule complexes to produce 
released mass labels; and (d) determining the mass of any released mass label 
by IR-MALDI mass spectrometry. In this respect, each reactive group 
5 recognizing a specific target molecule is associated with a unique set of mass 
labels. A plurality of target molecules may also be detected with the plurality of 
probes. In a further embodiment, prior to step (d). the mass label is mixed with 
a liquid matrix, preferably glycerol. 

A method for monitoring gene expression includes (a) obtaining a 
10 plurality of probes, each probe including a reactive group, a release group, and a 
mass label; (b) contacting a plurality of target nucleic acids with the plurality of 
probes to produce probertarget nucleic acid complexes; (c) selectively releasing 
the mass label from any probe belonging to the probertarget nucleic acid 
complex to produce re.eased mass labe.s; and (d, determining the mass of any 
15 released mass label by IR-MALDI mass spectrometry. In a further embodiment 
pnor to step (d), the mass label is mixed with a liquid matrix, preferably 
glycerol. 

The target nucleic acids may be amplified prior to step (a). 

A further method for detecting a target molecule includes: (a) amplifying 

20 one or more target nucleic acids to produce amplified nucleic acid products- (b) 
•incorporating one or more molecules including a reactive group, a release group 
and a mass label into the amplified nucleic acid product during the amplication 
Process; (c, selectively releasing the mass labe.s incorporated into the amplified 
nuc.e.c acid products to produce re.eased mass labels; and (d) determining the 

25 mass of the re.eased mass labels by IR-MALD. mass spectrometry. In a further 
embodiment, prior to step (d), the mass label is mixed with a liquid matrix, 
preferably glycerol. 

Another method for detecting a target molecule includes: (a) obtaining a 
probe comprising a reactive group, a release group and a mass label- (b) 
30 contacting the probe to a target nuc.eic acid mo.ecu.e to produce probemuc.eic 
acd molecu.e complexes; (c, mass modifying the probe:nucleic acid mo.ecu.e 
comp,exes by attaching a nuc.eotide or o.igonuc.eotide to the probe to produce 
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mass modified mass labels; (d) releasing the mass modified labels; and (e) 
determining the mass of the mass-modified labels by IR-MALDI mass 
spectrometry. In a further embodiment, prior to step (e). the mass label « 
mixed with a liquid matrix, preferably glycerol. 
5 Methods for detecting single nucleotide polymorphisms (SNPs) 

using mass-labeled molecules 
The methods utilizing mass-label molecules can also be used in the 
detection of single nucleotide polymorphisms (SNPs). Mass label probes may be 
prepared that hybridize immediately adjacent to a polymorphic site and a 
10 polymerase may then be used to add one base at the site of the polymorphism. 
For example, where a single probe is used, a mixture of the four cha.n- 
terminating triphosphates may be added, each with a unique mass label 
attached. In the homozygous SNP case only one of the four chairvterm.nat.ng 
nucleotides may add to the end of the probe coupling the associated mass label 
15 to the probe. Approaches to releasing the mass label from the probe include, 
but are not limited to, the use of chemically labile functional groups linkmg the 
mass label to the terminating nucleotide, chemically labile functional groups 
within the backbone of the extended primer or the chain-termination nuc.eot.de, 
or the use of an enzyme to cleave at one or more of the phosphodiester or 
20 glycosidic linkages within the primer extension product. In cases where the 
mass label release point is within the backbone of the extension product, the 
released mass label may include the terminal nucleotide or some mass-mod.f.ed 
version thereof. In another version where the release point is internal to the 
primer extension product, the native chain-termination nucleotides themselves 
25 may serve as all or a portion of the mass labels since each base possesses a 
unique mass. In cases where the mass label is chemically cleaved from the 
probe, any unincorporated nucleotides may first be removed or washed away so 
that they are not visualized by the mass spectrometer. 

Partitioning of the hybridized mass-labeled chain-terminating triphosphate 
30 may be done on the basis of mass differences, as labeled triphosphate 

hybridized to a target-hybridized probe will have a higher molecular weight than 
a labeled triphosphate that is not. The probe or target may also be attached to 
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a solid-phase via a number of means including biotin/streptavidin or chemical 
coupling or UV cross-linking. A nuclease may also be used to digest the mass- 
labeled probe. Using a nuclease the mass-labeled chain-terminating nucleotide 
will be released as a monophosphate. The unincorporated mass-labeled chain- 
5 terminating nucleotides will remain as triphosphates, and the resulting mass 
shift to monophosphate will indicate which nucleotide was incorporated. This 
method relieves the necessity to remove unincorporated nucleotides prior to 
analysis. 

Many SNPs may be detected simultaneously by multiplexing a large 
10 number of probes. Mass labels may be present to uniquely tag each of the 

probes that comprise the pool. The addition of a biotinylated chain-terminating 
nucleotide at the site of the point polymorphism may also be used to segregate 
the probe population depending on which probes incorporate a specific 
biotinylated chain-terminating nucleotide and which do not. As an example, the 
15 pool of mass-labeled probes with target may be divided into four reactions. The 
first reaction would contain only biotinylated dideoxy adenosine triphosphate, 
the second would contain only biotinylated dideoxy cytidine triphosphate, the 
third only biotinylated dideoxy guanidine triphosphate, and the fourth only 
biotinylated dideoxy thymidine triphosphate. Following a single base extension 
20 polymerase-dependent reaction in the presence of the proper nucleotide, the 
extended products are captured, washed and the mass labels are released for 
mass spectrometric analysis by IR-MALDI mass spectrometry. In the first 
reaction, only those mass-labeled probes that incorporate an A will be 
visualized. In the second reaction, only those mass-labeled probes that 
25 incorporated a C will be visualized. For the third and fourth reactions, probes 
that incorporated, respectively, a G or a T, will be visualized. 

Another example of a mass change within a mass label is the case where 
the mass label is present at the 3' end of the probe. Following polymerase- 
dependent based extension, the mass label may be released, including the 
30 chain-terminating base addition as well as the penultimate base. Placement of 
the mass label and the release site may be at the other bases with a preference 
of placement near the 3' end. In all cases, the mass label should preferably be 
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placed between the release group and the 3' end. In other embodiments, it may 
be preferred to perform what is effectively a short chain terminated sequencing 
reaction, where, in addition to dideoxy nucleotides, some amount of normal 
deoxy nucleotides are present. Extension of the primer will result in a nested 
5 set of products, each being chain terminated by a dideoxynucleotide correlating 
to its complementary base on the template strand. In the preferred form, the 
mass label may be located within the primer near the 3' end which contains a 
chemical release group. Such a method offers a separate embodiment for short 
sequence reads as well as detection of one or more SNPs. All of the SNP 
1 0 detection methods may involve the use of mass-modified forms of the different 
nucleotides in order to enhance the mass difference between the different 
possible products. 

SNPs may also be detected by the performance of a discriminating 
exonuclease event in the presence of matching and mismatching oligonucleotide 
15 probes. One example of this approach is to combine the use of releasable mass 
labels with nick translation PCR. In addition to its polymerase activity, Taq DNA 
polymerase has both 5' to 3' exonuclease and endonuclease activities. If a fully 
complementary oligonucleotide probe is placed in the path of polymerization, for 
example during nucleic acid amplification, the polymerase will attack the 5' end 
20 of the probe with its exonuclease activity, digesting the molecule until it is too 
small to remain hybridized. However, if the oligonucleotide is not perfectly 
complementary near the 5' end, e.g., a mismatch is present nearby, then the 
end of the probe will fray and be attacked by the endonucleolytic activity of the 
polymerase rather than the exonuclease activity. The nucleolytically cleaved 
25 product, preferably containing the mass label, will have a different final mass 

depending on whether or not a mismatch was present and how the nuclease cut 
in response to this mismatch. It has been demonstrated that the initiation of 
endonucleolytic activity can be influenced by the presence and placement of a 
mismatch within the hybridization probe. Selective placement of a mass label 
30 within the oligonucleotide probe relative to the expected mismatch site can be 
used to yield a differential signal depending on whether or not an actual 
mismatch is present. This assay can be extended to the simultaneous detection 
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of multiple SNPs. Each of the probes targeting a particular SNP contains one of 

the four possible bases to complement the site of the polymorphism. The 

placement of the mass label is such that if the probe contains a perfect match 

to the template, the mass label will be released by the exonuclease activity of 

the Taq polymerase, primarily in a form that includes a single nucleotide. The 

other probes will create a mismatch and the endonuclease activity of the 

polymerase will initiate cutting of the probe in such a way that includes more 

than one nucleotide. The shift in mass of the mass label cleavage product is 

diagnostic of whether or not a mismatch has occurred. 

Methods for identifying short sequences using mass-labeled 
probes 

The mass-labeled probes may be used to identify short sequences. In 
particular, combinations of hybridization and enzymatic (polymerase or ligase) 
extension can be employed with the labeled probes to identify short sequence 
runs adjacent to a "priming" or anchoring region. There are several methods for 
doing this. In one method, a mixture of probes are synthesized containing two 
domains, a fixed sequence recognition domain, typically containing only one or 
a few sequences, and a randomized domain, comprising the full set (or some 
subset) of all possible sequences. The fixed sequence of the probe is used to 
20 target hybridization of the probe to a single site within a particular target nucleic 
acid. This target site is typically invariant. The sequence adjacent to the 
invariant sequence is variable and, depending on the particular target, can have 
any one of the total combinations of sequence. In order to probe for ail 
possibilities, it is necessary to synthesize probes containing all the possible 
secondary domain sequence combinations. For example, if the second probe 
region is four bases in length, then 256 different probes need to be synthesized. 
The probes can be synthesized individually, each possessing a unique 
combination of mass labels as a releasable mass signature. Alternatively, the 
probes can be synthesized with unique mass signatures using a combinatorial 
synthesis method. 

In order to increase the level of discrimination and extend the read length 
for the short sequence read it is possible to use an enzyme, such as a 
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polymerase or ligase, to add a single nucleotide or o.igonuc.eotide to the end of 
the variable region of the anchored probe, optiona.ly including mass labels on 
the added nucleotide or o.igonucleotide that can identify the sequence for these 
additions. Addition of bases by either enzyme p.aces stricter requirements on 
5 the variable region being a perfect hybrid to enable enzymatic action. For 
polymerase the addition needs to be to the 3' end of the probe while ligation 
can occur at either the 3' or 5' end. 

Methods for detecting mismatches using mass-labeled probes 
in one method for detecting mismatches, amplified nucleic acid product 
10 contains a double-stranded molecule containing a mismatch, and an 

exonuclease-blocking functionality at the 3' ends of the strands. Typically, th» 
method may further include cleavage of at least one strand of the double- 
stranded molecule at the site of the mismatch and selective releasing of the 
mass label. Selective releasing of the mass label may typically be accompl.shed 
15 by digestion of the cleaved strand by a 3' to 5' exonuclease, such as 

exonuclease III. In selective releasing, a mass label is released from a probe 
which belongs to a probe:target molecule complex without releasing a mass 
label from a probe not belonging to such a complex without having to phys.ca.ly 
partition the two types of probes. The mismatch may be cleaved by an 
20 enzyme, such as mutHLS, T4 endonuclease VII, mutY DNA glycosy.ase. 

thymine mismatch DNA glycosylase or endonuclease V. The mismatch may 
also be cleaved by a chemical, such as Os0 4 , HONH 2 or KMn0 4 . 

Analyzing DNA tandem nucleotide repeat alleles using IR-MALDI mass 
spectrometry 

25 Analyzing DNA tandem nucleotide repeat alleles 

IR-MALDI mass spectrometry can be used to analyze DNA tandem 

nucleotide repeat alleles and multiplex the identification of more than one DNA 

tandem nucleotide repeat regions from more than one DNA tandem nucleotide 
30 repeat loci. As noted above, methods using UV mass spectromeric methods 

for such analyses are known to those of skill in the art. These methods are 

modified herein for use with IR-MALDI. 
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In one embodiment, a method for analyzing DNA tandem nucleotide 
repeat aHe.es at a DNA tandem nucleotide repeat locus in a target nuc.eic acid 
by IR-MALDI is provided. This method includes the steps of extending a target 
nuc.e,c and using one or more primers to obtain a limited size range of nucleic 
5 acd extension products, where one or more primers are complementary to a 
sequence flanking the DNA tandem nucleotide repeat of tjhe locus; and 
determining the mass of the nuc.eic acid extension products by .R-MALD. mass 
spectrometry with a liquid matrix. 

In one embodiment, the 3' end of the one or more primers immediately 
» flanks a DNA tandem nucleotide repeat region. In another embodiment, the one 
or more primers includes a sequence complementary to one, two or three 
tandem repeats of the DNA tandem nucleotide repeat locus or loci. In another 
embodiment, at least one primer in.cudes a cleavable site. The cleavab.e site 
preferably includes a recognition site for a restriction endonuclease, an 
exonuclease blocking site, or a chemically cleavable site. In another 
embodiment, at least one primer is can be attached to a solid support. Means 
for attachment includ biotin or digoxigenin. 

In another embodiment, the extension of at least one primer is 
terminated using a chain termination reagent, such as a dideoxynucleotide 
tnphospate. More than one target nucleic acids can be extended to produce 
more than one nuc.eic acid extension products, so that a plurality of nuc.eic 
acd. can be ana.yzed. For example, in one embodiment, the mass of more 
than one DNA tandem nucleotide repeat allele at more than one DNA tandem 
nucleotide repeat loci are determined simu.taneous.y. ,n another embodiment 
the DNA tandem nucleotide repeat loci have over.apping a.le.ic mass ranges ,n 
another embodiment, the nuc.eic acid extension products have inter.eaving 
mass spectra, peaks. ,n another preferred embodiment, at .east one nuc.eic 
acd extension product contains a mass modified nucleotide. 

In yet another preferred embodiment, the length of at least one nuc.eic 
acd extension product is reduced by cleaving the nuc.eic acid extension product 
at a c.eavab.e site. More preferab.y, the c.eavab.e site comprises a restriction 
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endonuclease site, an exonuclease blocking site, or a chemically cleavable 
group. 

2 Multiplexing the identification of more than one DNA tandem 
nucleotide repeat regions from more than one DNA tandem 
5 nucleotide repeat loci 

In methods for multiplexing the identification of more than one DNA 
tandem nucleotide repeat regions from more than one DNA tandem nucleotide 
repeat loci by mass spectrometry is provided. These methods include the steps 

10 of obtaining more than one nucleic acid extension products by extending one or 
more primers complementary to sequences flanking the DNA tandem nucleotide 
repeat regions; and b) determining the mass of the more than one nucleic acid 
extension products simultaneously by IR-MALDI mass spectrometry with a liquid 
matrix, where the nucleic acid extension products have overlapping allelic mass 

15 ranges. 

In one embodiment, the 3' end of the one or more primers immediately 
flanks a DNA tandem nucleotide repeat region. In another embodiment, the one 
or more primers comprise a sequence complementary to up to one, two or three 
tandem repeat of the DNA tandem nucleotide repeat locus or loci. In another 
20 embodiment, the extension of at least one primer is terminated using a chain 
termination reagent, such as a dideoxynucleof.de triphospate. 

In yet another embodiment, at least one target nucleic acid extension 
product contains a mass modifying group. More preferably, the mass modifying 
group includes a mass modified nucleotide. Also more preferably, the mass 
25 modifying group comprises a nonstandard deoxyribonucleotide. In yet another 
embodiment, the cleavable site includse a recognition site for a restriction 
endonuclease, an exonuclease blocking site, or a chemically cleavable site. In 
yet another embodiment, the mass modifying group is incorporated during or 
after extension of the nucleic acid extension product. 
30 In another embodiment, a method for multiplexing the identification of 

more than one DNA tandem nucleotide repeat regions from more than one DNA 
tandem nucleotide repeat loci by mass spectrometry is provided. This method 
includes the steps of obtaining more than one nucleic acid amplification 
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products by amplifying two or more primers complementary to sequences 
flanking the DNA tandem nucleotide repeat regions; and determining the masses 
of more than one nucleic acid amplification products simultaneously by IR- 
MALDI mass spectrometry with a liquid matrix, where the nucleic acid 
extension products have overlapping allelic mass ranges. 

In one such embodiment, the 3' end of the one or more primers 
immediately flanks a DNA tandem nucleotide repeat region. In another 
embodiment, the one or more primers include a sequence complementary to up 
to one, two or three tandem repeat of the DNA tandem nucleotide repeat locus 



or loci. 



In still another embodiment, at least one nucleic acid amplification ' 
product contains a mass modifying group that preferably includes a mass 
modified nucleotide, such as a nonstandard deoxyribonucleotide. The mass 
modifying group is can be incorporated before, during or after amplification. In 
yet another embodiment, the cleavable site includes a recognition site for a 
restriction endonuclease, an exonuclease blocking site, or a chemically cleavable 



site. 



3 " S^rS?" 9 mutations ln a ta 'get nucleic acid using IR- 
MALDI mass spectrometry 

As noted herein, IR-MALDI mass spectrometry can be used to detect 

mutations in a target nucleic acid. In one embodiment, a method for detecting 

mutations in a target nucleic acid is provided. The method includes obtaining 

from the target nucleic acid a set of nonrandom length fragments (NLFs) in 

single-stranded form, where the set includes NLFs derived from one of either 

the positive or the negative strand of the target nucleic acid or the set is a 

subset of single-stranded NLFs derived from the positive and the negative 

strand of the target nucleic acid; and then determining masses of the members 

of the set by IR-MALDI mass spectrometry with a liquid matrix. 

In one embodiment, at least one member of the set of single-stranded 

NLFs optionally has one or more nucleotides replaced with mass-modified 

nucleotides. In another embodiment, the determining step optionally further 

includes using internal self-calibrants to provide improved mass accuracy. In 
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another embodiment, the target nucleic acid is single-stranded and the obtaining 
step includes hybridizing the single-stranded target nucleic acid to one or more 
sets of fragmenting probes to form hybrid target nucleic acid/fragmenting probe 
complexes. The complexes contain at least one double-stranded region and at 
least one single-stranded region. The target nucleic acid molecule is then 
nonrandomly fragmented by cleaving the hybrid target nucleic acid/fragmenting 
probe complexes at every single-stranded region with at least one single-strand- 
specific cleaving reagent to form a set of NLFs. Preferably, the set of 
fragmenting probes leaves single-stranded gaps between double-stranded 
regions formed by hybridization of the set of fragmenting probes to the target 
nucleic acid. The hybridizing step can further inlcude providing two sets of 
single-stranded target nucleic acid and separately hybridizing a first set of 
fragmenting probes to a first set of single-stranded target nucleic acid and a 
second set of fragmenting probes to a second set of single-stranded target 
nucleic acid, where the members of the second set of fragmenting probes 
include at least one single-stranded nucleotide sequence complementary to 
regions of the target nucleic acid that are not complementary to any nucleotide 
sequences in any members of the first set of fragmenting probes. Further, the 
members of the first set of fragmenting probes can include sequences of 
nucleotides that overlap with sequences of nucleotide of the members of the 
second set of fragmenting probes. 

In yet another embodiment, the single-strand-specific cleaving reagent is 
a single-strand-specific endonuclease or a single-strand specific chemical 
cleaving reagent, such as, but are not limited to, reagents such as cleaving 
reagent is hydroxylamine, hydrogen peroxide, osmium tetroxide and potassium 
permanganate. 

In another embodiment, where the target nucleic acid is single-stranded, 
a further step after the nonrandomly fragmenting step is included. This step 
involves hybridizing one or more of the NLFs to one or more capture probes, 
where the capture probes contain a single-stranded region complementary to at 
least one of the NLFs and a first binding moiety, binding the first binding moiety 
to a second binding moiety attached to a solid support, where the binding 



WO 99/57318 



PCT/US99/10251 



-130- 



occurs either before or after the hybridizing of the NLFs to one or more capture 
probes, and isolating a set of single-stranded NLFs. 

In another embodiment, where the target nucleic acid is single-stranded 
and fragmenting probes are used, the fragmenting probes include a single- 
i stranded portion and a first binding moiety. The method further involves, after 
the nonrandomly fragmenting step, binding the first binding moiety to a second 
binding moiety attached to a solid support, and isolating the set of single- 
stranded NLFs. 

In another embodiment, the obtaining step further includes nonrandomly 
' fragmenting the target nucleic acid with one or more restriction endonucleases 
to form a set of NLFs; hybridizing one or more of the set of NLFs or a subset 
thereof to one or more oligonucleotide probes, where each of the 
oligonucleotide probes includes a nucleic acid comprising a single-stranded 
region and a first binding moiety, binding the first binding moiety to a second 
binding moiety attached to a solid support either before or after the hybridizing 
step; and isolating the set or subset of single-stranded NLFs. Preferably, all of 
the oligonucleotide probes include one of either full-length positive or full-length 
negative single strands of the target nucleic acid and a first binding moiety; or 
the binding between the first binding moiety and the second binding moiety is a 
covalent attachment; or one binding moiety is an antibody, a hormone, an 
inhibitor, a co-factor portion, a binding ligand, and a polynucleotide sequence, 
and the other binding moiety is a corresponding member of an antigen capable 
of recognizing the antibody, a receptor capable of recognizing the hormone, an 
enzyme capable of recognizing the inhibitor, a cofactor enzyme binding site 
capable of recognizing the co-factor portion, a substrate capable of recognizing 
the binding ligand, or a complementary polynucleotide sequence; or the isolating 
further comprises washing the set of NLFs bound to the solid support with a 
composition comprising volatile salts such as ammonium bicarbonate, dimethyl 
ammonium bicarbonate and trimethyl ammonium bicarbonate. 

In another embodiment, where the target nucleic acid is single-stranded, 
the obtaining step further includes hybridizing the single-stranded target nucleic 
acid to one or more restriction site probes to form hybridized target nucleic 
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acids having double-stranded regions where the restriction site probes have 
hybridized to the single-stranded target nucleic acid and at least one single- 
stranded region, nonrandomly fragmenting the hybridized target nucleic acids 
using one or more restriction endonucleases that cleave at restriction sites 
5 within the double-stranded regions. A further step after the nonrandomly 
fragmenting step can be included. This further step includes hybridizing the 
NLFs to one or more capture probes, where the capture probes comprise a 
single-stranded region complementary to at least one of the NLFs and a first 
binding moiety, binding the first binding moiety to a second binding moiety 
10 attached to a solid support, wherein the binding occurs either before or after the 
hybridizing of the NLFs to one or more capture probes, isolating a set of single- 
stranded NLFs. Also, preferably, the cleaved restriction site probes include a 
, single-stranded region complementary to half of a restriction endonuclease site 
and a first binding moiety, and the method further includes, after the 
15 nonrandomly fragmenting step, binding the first binding moiety to a second 

binding moiety attached to a solid support, and isolating a set of single-stranded 
NLFs. 

In another embodiment, where the target nucleic acid is single-stranded, 
the obtaining step further includes performing the method under conditions 
20 permitting folding of the single-stranded target nucleic acid to form a three- 
dimensional structure having intramolecular secondary and tertiary interactions; 
nonrandomly fragmenting the folded target nucleic acid with at least one 
structure-specific endonuclease to form a set of single-stranded NLFs, modifying 
either the target nucleic acid or the set of single-stranded NLFs such that 
25 members of the set of single-stranded NLFs include a single-stranded nucleotide 
sequence and at least one first binding moiety; binding the first binding moiety 
to a second binding moiety attached to a solid support; and isolating the set of 
single-stranded NLFs. 

In another embodiment, where the target nucleic acid is single-stranded, 
30 the obtaining step also can include providing conditions permitting folding of the 
single-stranded target nucleic acid to form a three-dimensional structure having 
intramolecular secondary and tertiary interactions; nonrandomly fragmenting the 
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folded target nucleic acid with at least one structure-specific endonuclease to 
form a set of single-stranded NLFs; hybridizing one or more of the set of NLFs 
to one or more capture probes, where the capture probes contain a single- 
stranded nucleotide sequence and a first binding moiety; binding the first 
binding moiety to a second binding moiety attached to a solid support either 
before or after the hybridizing step; and isolating a set of single-stranded NLFs. 
More preferably, the isolated set of single-stranded NLFs include any NLFs 
having a 5' and or 3' end of the target nucleic acid. Also preferably, the 
structure-specific endonuclease is T4 endonuclease VII, RuvC, MutY. or the 
endonucleolytic activity from the 5'-3' exonuclease subunit of thermo-stable 
polymerases. 

In another embodiment, where the target nucleic acid is single-stranded, 
the obtaining step further includes hybridizing the single-stranded target nucleic 
acid to one or more wild type probes, nonrandomly fragmenting the target 
15 nucleic acid with one or more mutation-specific cleaving reagents that 

specifically cleave at any regions of nucleotide mismatch that form between the 
target nucleic acid and any of the wild type probes. More preferably, the 
nonrandomly fragmenting step further includes digesting the first set of 
nonrandom length fragments with one or more restriction endonucleases or 
20 cleaving the first set of nonrandom length fragments with one or more single- 
strand-specific cleaving reagents. Also preferably, members of the set of single- 
stranded NLFs comprise a single-stranded region and at least one first binding 
moiety; and the method includes, after the nonrandomly fragmenting step, 
binding the first binding moiety to a second binding moiety attached to a solid 
support; and isolating a set of single-stranded NLFs. Further, the obtaining step 
can further include hybridizing members of the set of NLFs to one or more 
capture probes, where the capture probes include a single-stranded portion and 
at least one first binding moiety; and the method includes binding the first 
binding moiety to a second binding moiety attached to a solid support, and 
isolating a set of single-stranded NLFs. The obtaining step can further include 
isolating a set of single-stranded NLFs containing any NLFs that have a 5' end 
of the target nucleic acid. 
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In another embodiment, a method for detecting mutations in a target 
nucleic acid is provided. The method includes the steps of nonrandomly 
fragmenting, preferably in a restriction buffer containing volatile salts, the target 
nucleic acid with one or more restriction endonuc.eases to form a set of double- 
5 stranded NLFs; and determining masses of the members of the set of double- 
stranded NLFs by IR-MALDI mass spectrometry with a liquid matrix. 

In still another specific embodiment, a method for detecting mutations in 
a target nucleic acid is provide that includes the steps of nonrandomly ' 
fragmenting the target nucleic acid using one or more restriction endonucleases 
10 to form a first set of nonrandom length fragments (NLFs); hybridizing members 
of the first set of NLFs to a set of wild type probes; nonrandomly fragment.ng 
one or more members of the set of NLFs with one or more mutation-specific 
cleaving reagents that specifically cleave at any regions of nucleotide mismatch 
that form between members of the first set of NLFs and complementary 
15 members of the set of wild type probes, where the nonrandomly fragmenting 
step forms a second set of NLFs; and determining masses of members of the 
second set of NLFs using IR-MALDI mass spectrometry with a liquid matrix. 
More preferably, the set of wild type probes obtained by nonrandomly 
fragmenting a wild type target nucleic acid are obtained using the same 
20 restriction endonuc.eases used to form the first set of NLFs. More preferably, 
the steps of nonrandomly fragmenting of the target nucleic-acid and obta.nmg 
the set of wild type fragmenting probes are performed simultaneously in a smgle 
composition. Also more preferably, a further step before the determining step .s 
included. This further step includes isolating the second set of NLFs, where the 
25 members of the second set include a double-stranded region and a first binding 
moiety; and binding the first binding moiety to a second binding moiety 
attached to a solid support. A further step before the determining step can be 
included. This step includes isolating the second sep of NLFs by hybridizing 
members of the second set of NLFs to one or more capture probes, where the 
30 capture probes include a single-stranded regione and a first binding moiety; and 
binding the first binding moiety to a second binding moiety attached to a sol.d 
support. 
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ln another embodiment, a method for detecting mutations in a target 
nucleic acid is provided that includes nonrandomly fragmenting the target 
nucleic acid in a composition containing one or more volatile salts to form a set 
of nonrandom length fragments (NLFs,; and determining masses of members of 
5 the set of NLFs using IR-MALDI mass spectrometry with a liquid matrix. 

In another embodiment, a method for decreasing background noise is 
provided. In this method, the sample is washed with a composition of volatile 
salts, which is then evaporated from the sample. 

ANALYSIS OF DOUBLE-STRANDED NUCLEIC ACID USING IR-MALDI 

> IR-MALDI is advantageously used for analysis of double-stranded nucleic 

acids. It is shown herein, that for analysis of longer fragments, the liquid matrix 
should include a salt, such as salts of amines, including ammonium salts or 
other salts that are compatible with mass spectrometry analysis of nucleic acids 
(see, e.g., Nordhoff eta/. Mass Spectrom. Rev. 1996, 15, 67-138), to raise the 
ionic strength. 

As exemplified herein (EXAMPLES) double stranded DNA molecules 
ranging from 9 kDA to over 500 kDA were desorbed and analyzed by MALDI 
TOF mass spectrometry. IR-MALDI with glycerol as matrix yielded excellent 
results for larger double stranded DNA by adjusting the ionic strength through 
the addition of salts. Very little fragmentation and a routine sensitivity in the 
sub-picomole range were observed in IR-MALDI when double stranded analytes 
harboring 70 base pairs or more were probed. In the lower mass range (up to 
approx. 70 bp), UV-MALDI with 6-aza-2-thiothymine (ATT, as matrix was used 
Essentially quantitative detection of the double stranded form was observed for 
a 70-mer. With larger fragments, UV-MALDI, however, was accompanied by 
significant fragmentation and a resulting reduced sensitivity and mass 
resolution. 

Automated Analyses 

The methods described herein may be used as part of automated 
processes. For example, U.S. application Serial No. 09/285,481, provides a 
fully automated modular analytical system that integrates sample preparation 
instrumentation, and analysis of biopolymer samples. The system integrates 
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analytical methods of detection and analysis, e^, mass spectrometry, 
radiolabeling, mass tags, chemical tags, fluorescence chemiluminescence, and 
the such labeling moieties, with robotic technology and automated chemical 
reaction systems to provide a high-throughput, accurate automated process line. 

5 The instrumentation and processes described herein may be performed and 
integrated into the automated process line or into any automated analysis 
system or protocol. 

A fully automated modular analytical system integrates instrumentation 
to permit analysis of biopolymer samples. The samples include, but are not 
10 limited to, all biopolymers, e^, nucleic acids, proteins, peptides and 

carbohydrates. The system integrates analytical methods of detection and 
analysis, e^, mass spectrometry, radiolabeling, mass tags, chemical tags, 
fluorescence and chemiluminescence, with robotic technology and automated 
chemical reaction systems to provide a high-throughput, accurate Automated 

15 Process Line (APL). 
KITS 

Also provided herein are kits for performing IR-MALDI with a liquid 
matrix. The kits include a liquid and a support and optionally instructions for 
performing IR-MALDI as well as particular controls. The kits can also contain a 
20 support that comprises an array that includes at least one target biologica 
macromolecule immobilized at a defined position on the array on a support. 

The following examples are included for illustrative purposes only and are 
not intended to limit the scope of the invention. 
2 5 EXAMPLE 1 

MALDI MASS SPECTROMETRIC ANALYSIS OF NUCLEIC ACID MOLECULES 
CONTAINING 70 to 2180 NUCLEOTIDES 

This example demonstrates that incorporation of nucleic acid molecules 
in a liquid matrix allows accurate mass determination of the nucleic acid 
30 molecules.- 
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A. Materials and Methods 
1. Samples 

Synthetic oligodeoxynucleotides were obtained from Pharmacia Biotech 
(Uppsala, Sweden). The 70-mer was FPLC-purified by the supplier; smaller 
5 oligonucleotides were used without additional purification. Plasmid DNA was 
purified from E. coli strain DH5a by use of the Qiagen midiprep kit (QIAGEN 
GmbH; Hilden, Germany) according to the manufactures recommendations. 
Restriction enzymes were obtained from New England Biolabs GmbH 
(SchwalbachTTaunus, Germany); restriction enzyme digests of plasmid DNA 
» were performed according to the supplier's protocols. Samples intended for 
MALDI mass spectrometry analysis were adjusted to 10 mM EDTA and 
2 M NH 4 acetate, and precipitated with 2 volumes of ethanol. The pellet was 
washed once with 70% ethanol and dissolved in water to an approximate 
concentration of 0.5 pmol///l. 

The 1 206 nucleotide in vitro transcript was synthesized and ethanol 
precipitated according to standard procedures , (Kirpekar et al_., Nucl. Acids Res. 
22:3866-3870 (1994)), using the Seal digested plasmid pBluescript KS+ as 
template for the T3 RNA polymerase (MBI Fermentas; Vilnius, Lithuania). A 
.KM Poros 50 R2 (PerSeptive Biosystems; Framingham, MA) reverse phase 
column was prepared and equilibrated with 3% acetonitrile/1 0 mM triethyl 
ammonium acetate (TEAA) as described elsewhere (Kussman et aL, J. Mass. 
S^ectrorrL 32:593-6010 (1997)). The RNA sample was adjusted to 
0.3 M TEAA and loaded onto the column. The column was washed with 200 jj\ 
3% acetonitrile/1 0 mM TEAA, and the sample was eluted with 
10 fj\ 25% acetonitrile/1 0 mM TEAA. 

Subsequent to lyophilization, the eluate was dissolved in 5 /vl water; the 
estimated sample concentration was 1 pmol/yt/l. A crude DNA preparation from 
mycoplasma-infected HeLa cells was made, and PCR was performed essentially 
as described (Hopert et al., J. Immunol. M»th 164:91-100(1993)) using the 
primers 5'-CGC CTG AGT AGT ACG TTC GC-3' (SEQ ID NO: 1 ) and 
5'-GCG GTG TGT ACA AGA CCC GA-3' (SEQ ID NO: 2), and recombinant Taq 
DNA polymerase (MBI Fermentas). The PCR results in an approximately 515 bp 
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DNA fragment originating from the 1 6S rRNA gene of mycoplasma (Hopert et 
ah, J. Immunol. Meth. 164:91-100 (1993)), however, the precise length of the 
PCR product cannot be predicted because the species of the mycoplasma is 
unknown. 

5 A reamplification by PCR was performed under identical conditions using 

the same primer set, and the final product was adjusted to 4 mM EDTA/ 
2 M NH 4 acetate, and precipitated as described for the restriction enzyme 
digests. The pellet was dissolved in 200 /vl water and purified over a 
Microcon-100 (Amicon GmbH; Witten, Germany) microconcentrator by three 
10 successive diafiltrations with 100/71 water as recommended by the 

manufacturer. The retentate was lyophilized and redissolved in water to a 
concentration of 0.6 pmol///l as determined by UV spectrophotometry. 
2. Sample Preparation 

For IR-MALDI, glycerol was used as the matrix. The glycerol was 

15 incubated with an equal volume of a H+ cation exchange bead suspension 

(Dowex 50W-X8; Biorad AG; Munich, Germany) in order to reduce subsequent 
alkali salt formation of the nucleic acid backbone phosphates. Typically, 
0.5 to 1 //I of glycerol was mixed with an equal amount of an aqueous analyte 
composition on the target to give a final analyte-to-glycerol molar ratio of the 

20 sample of about 10 * to 10' 7 , depending on the mass of the analyte. The- 
mixture was smeared evenly over an area of about 1 to 2 mm 2 to form a 
homogeneous, transparent thin layer on the stainless steel substrate. The water 
was evaporated off at a pressure of about 10 2 -1 Pa before the sample was 
introduced into the mass spectrometer. 

25 Samples for UV-MALDI mass spectrometry were prepared by on-target 

mixing of 1 //I of a 10" 5 to 10' 6 M aqueous analyte composition with 0.7 //I of a 
50 g/l 3-hydroxypicolinic acid (3HPA) composition in 20% acetonitrile. About 
ten ammonium-loaded cation exchange beads were added to the samples before 
drying in a cool stream of air (Nordhoff et aL, Rapid Commun. Mass Spectrom. 

30 6:771-776(1992). 
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3. Instrumental 

Experiments were performed using an in-house built MALDI single stage 
reflectron time-of-flight (refTOF) mass spectrometer of 3.5 m equivalent flight 
length (Berkenkamp et aL, Rapid Commun. Mass Spectrom. 11:1 399-1 406 
5 (1 997)). The mass spectrometer also can be used in the linear TOF (linTOF) 
mode. Unless specifically mentioned, the experiments were carried out in 
reflectron - and positive ion mode. 

Ions are accelerated through a total potential difference of about 
1 6-25 kV in the split extraction source using either static or delayed ion 
10 extraction (DE). A Venetian blind secondary electron multiplier (EMI 9643) with 
a conversion dynode, mounted 10 mm in front of the cathode (ion impact 
energy of about 20 to 40 kV, depending on ion mass) on a Chevron 
Micro-Channel plate (Galileo Co.; Sturbridgem MA) are used for ion detection. 
For high mass ions, the potential between the conversion dynode and the 
T5 electron multiplier cathode is set to several thousand volts in order to increase 
the ion signal by making efficient use of the secondary ions. If maximum mass 
resolution is sought in the mass range up to several thousand Daltons, the 
potential between the two electrodes is kept below about 500 V in order to 
detect secondary electrons only and thereby avoid the time (and mass) 
20 dispersion of the secondary ions (see, for example. Figure 2A). Signals are 

processed by a transient recorder with a time resolution of about 0.5 ns (LeCroy 
9350). The digitized data are transferred to a PC for storage and further 
evaluation. 

For IR-MALDI experiments, an Er-YAG-Laser emitting at 2.94 //m 
25 (Spectrum GmbH; Berlin, Germany; t= 80-90 ns, energy stability ca. ±2-4% 
from shot to shot) was used. A frequency tripled Nd-YAG laser, emitting in the 
UV at 35 nm (rz = 6 ns) was used for direct comparison between IR-MALDI and 
UV-MALDI. Single laser pulses are focused to a spot diameter of approximately 
1 50 pm (IR) or 100 pm (UV) on the sample under an angle of 45°. Samples are 
30 observed in situ with a CCD camera having about 5 pm resolution. 
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B. Results 

UV-MALDI spectra of DNA having at least about 50 nucleotides and with 
a reasonable quality could be obtained only in the linTOF, DE mode. Figures 1A 
and 1B demonstrate the striking differences in spectra quality for the two 
modes of operation for a synthetic DNA 70-mer {approx. 21.5 kDA) and a 3HPA 
matrix (355 nm). The quality of the spectrum of Figure 1B, obtained in 
reflectron mode is quite inferior to that of Figure 1 A in several respects. Signal 
intensity and signal-to-noise ratio are considerably degraded, as is the mass 
resolution, down to 15 (M/Am; FWHM) from 65 in the spectrum of Figure 1A. 
The saturated signal in the mass range below approximately 2000 Da in Figure 
2B reflects the increased laser fluence necessary to obtain analyte signals of the 
intensity shown. The loss in mass resolution is, for the most part, a result of 
the sloping low mass edge of the peaks, signaling abundant metastable small 
neutral lasses. Exact mass determination is severely compromised by the loss 
of spectral quality. 

The IR-MALDI spectrum (refTOF, DE mode) of the same DNA 70-mer 
with glycerol as matrix is shown in Figure 1C. The quality of this spectrum is 
comparable to UV-MALDI analysis obtained in the linear mode with respect to 
signal intensity and mass resolution (Figure 1A). The base peak has a steeply 
rising low mass edge, demonstrating an essential absence of any metastable 
small neutral loss. This behavior was consistently observed for IR-MALDI of 
nucleic acid with glycerol as a matrix qualifying it as a very gentle desorption 
method forming ions of nucleic acids of high ion stability. This contrasts 
strikingly to the IR-MALDI spectra of nucleic acids obtained with succinic acid 
as matrix (see Nordhoff et aL, Nucl. Acids Res. 21 : 3347-3357 (1 993), Figures 
1d and 1e) The absence of literally all metastable neutral loss for the glycerol 
matrix was, therefore, a very unexpected result not anticipated based. on prior 
experience. 

Analysis of nucleic acids by IR-MALDI mass spectrometry is useful for a 
broad mass range of nucleic acids, from small oligonucleotides to molecules 
having up to more than 2000 nucleotides (see Figure 2). A refTOF IR-MALDI 
mass spectrum of a synthetic 21-mer DNA is shown in Figure 2A. With delayed 
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ion extraction, a mass resolution of 1050 (FWHM) was obtained, comparable to 
the resolution obtained with the instrument for proteins in this mass range. 
Several poorly resolved peaks on the high mass side of the analyte peak that 
appeared in the spectrum are detection artifacts of residual secondary ions 
> generated at the conversion dynode, operated here in a mode to preferentially 
detect only secondary electrons in order to not degrade mass resolution by the 
ion detection system. 

Figure 2B demonstrates the high mass range with a restriction enzyme 
digest of a plasmid (pBluescript-KS + digested with Bgll and Rsal), yielding four 
fragments of 280 bp, 360 bp, 920 bp and 1,400 bp. All four signals represent 
single strands and are the composite signal of the two complementary strands. 
Very weak, if any, signals of the double stranded oligomers are apparent in this 
spectrum. The dissociation of the double strands in samples prepared with 
purified glycerol tentatively is attributed to an acidification by the H+ ion 
exchange resin. The mass resolution of all high mass ion signals is about 50 
(FWHM) and appears to be relatively independent of the ion mass. 

The IR-MALDI mass spectrum of Figure 1C shows the upper mass limit 
measured so far for a restriction enzyme digest (130 bp, 640 bp and 2,180 bp). 
The signal of the 2,180 nucleotide single stranded fragment was obtained only 
after heating the restriction digest to a temperature of 95 °C for 5 minutes, 
apparently because such large DNA fragments do not get separated into single 
strands under the conditions used, in contrast to the DNA fragments up to 
1400 bp. The relatively poor mass resolution of about 30 for the 2,1 80 
nucleotide fragment in this spectrum and the strong background signals indicate 
an upper mass limit for IR-MALDI mass spectrometry of nucleic acids of 
approximately 700 kDa under the current conditions. Accordingly, the double 
stranded 2,180 nucleotide fragment was not observable. 

IR-MALDI mass spectrometry of large RNA molecules also was possible, 
including a 1206 nucleotide RNA in vitro transcript (Figure 2D). The increased 
ion stability for RNA compared to DNA. which is well documented for 
UV-MALDI, was not observed for IR-MALDI in the mass range examined. Large 
DNA ions, as well as large RNA ions, appeared to be of comparable stability. 
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stable enough even for TOF analysis in the reflection mode. The large hump 
centered at about 50 kDa is believed to reflect impurities of the sample rather 
than metastab.e fragments. The comparably steep rise of the peak at the low 
mass side also testifies to a very limited loss of sma.l neutrals such as s.ng.e 
5 bases. 

One advantage of glycerol as matrix is the superior shot to shot 
reproducibility and mass precision (200-400 ppm; see Nordhoff et aL. NucL 
Acids Res . 21: 3347-3357 (1993)). These values, originally determ.ned for 
proteins are also valid for analysis of smaller oligonucleotides. Mass accuracy 
10 was mass dependent. Using an externa. 2 point calibration with angiotens.n II 
(1047 Da) and bovine insulin. (5743 Da), the mass of the 21-mer (6398 Da) was 
determined to within ±2 Da of the known mass, i.e., an accuracy of 0.003% 
(see Figure 2A). The molecular mass of the 70-mer (theoretical mass 
21517 Da) was determined to within ±25 Da, i.e., a mass accuracy of 
15 0.1% from the spectrum of Figure 1C, ca.ibrated with cytochrome C oligomers 

(M + , 2M + .3M + ). 

For each of the ten different samples of high mass DNA analyzed, the 
measured mass was within less than about 1 % of the theoretical mass derived 
from the sequence (see, for example, Figures 2B and 2C). The average mass of 
20 the two single strands was used as the theoretical mass in the case of DNA 
restriction enzyme fragments. The masses of the two single strands never 
differed by more than about 1 %. Only one large mass RNA was measured 
(Figure 2D). The measured mass of this RNA was 388,270 Da, whereas the 
mass calculated from the gene sequence is 386,606 Da. Given that the sample 
25 most likely is a heterogeneous mixture of the species expected from the gene 
sequence, with less abundant products extended by one to three extra 
nucleotides (Melton et aL. NucLAcid^es, 1 2:7035-7056 (1984)), the actual 
mass of the RNA sample is probably about 500 Da larger than that calculated 
from the sequence. It would appear, therefore, as though a mass accuracy of 
30 at least about 1 % as observed for DNA also can be achieved for RNA. 

For external 4 point calibrations of large DNA or RNA molecules w.th 
molecular masses between 100 and 400 kDa, either clusters of cytochrome C, 
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for example, 10M + , 20M + , 30M + , 40M + , or multimers of an IgG monoclona, 
antibody, for example, 2M + 3M + , 4M + , were used. For analytes exceeding 
500 kDa the calibration with IgG monoclonal antibody was more exact. Mass 
calibration of unknown DNA fragments using DNA or RNA calibrants may be 
5 more desirable, resulting in more accurate mass determination. 

Experiments to evaluate the sensitivity of IR-MALDI mass spectrometry 
of large nucleic acids with glycerol as matrix were carried out with a PCR 
product of an unknown sequence having approximately 51 5 nucleotides; the 
mass was measured to be 318,480 Da. For these measurements, glycerol, not 
10 subjected to ion exchange purification, was used. The spectra show dominant 
s.gnals of the double stranded moiety. The dissociation of the double strands in 
samples prepared with purified glycerol tentatively is attributed to an 
acidification of the glycerol by the protons exchanged for the cations, although 
add,t.onal parameters may be involved in the double strand dissociation under 
15 IR-MALDI conditions. 

The starting concentration for the dilution experiment was 0.6 pmol/l as 
determined by UV spectrophotometry. The mass spectra were obtained by 
loading different amounts of sample onto the target (Figure 3). For the single 
shot mass spectrum shown in Figure 3A, 300 f mol of the PCR product was 
20 loaded. The quality of this spectrum, with a S/N ratio better than 1 00 and a 
mass resolution of 65 (FWHM) for the double strand, indicates that the analyte 
to matrix ratio (A/M, of about 10 ' is well suited for an analyte of this size 
(about 320 kDa). 

The mass spectrum in Figure 3B was obtained using a 3 fmol total load 
(A/M about 2 x 10*). A strong background signal dominated the low mass 
range. Total signal intensity, mass resolution (of about 25 FWHM for the ds-ion 
signal), and S/N-ratio were significantly degraded compared to Figure 3A Mass 
determination still was possible with an accuracy of about 1%. The spectrum 
m F,gure 3C was obtained from a very small sample volume, forming an 
approximately 270 ^m diameter sample spot on the target and a total sample 
load of only 300 attomol (A/M about 8 x 10-,. Such small sample volumes 
can be realized either by dispensing the small volumes using micropipets (see 
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for example, Little, Anal. Chem. 69:4540-4546 (1997)), or by preparing the 
analyte in a standard microliter volume of a suitable glycerol/water mixture. In 
the latter case, the water is evaporated off prior to or upon insertion of the 
sample into the vacuum. The poor mass resolution of only about 10 classifies 
5 300 attomol of analyte as the limit for the particular instrument and detection 
system used for a mass accuracy of better than about 3%. Compared to values 
reported for UV-MALDI mass spectrometry (Tang et aL, Rapid Commun. Mass , 
Soectrom. 8:727-730 (1994); see Figures 5 and 6), the sensitivity obtained 
here for IR-MALDI mass spectrometry demonstrates an improvement of at least 
10 about 2 to 3 orders of magnitude for nucleic acids of this size. 

EXAMPLE 2 

Performance of IR matrix-assisted laser desorption/ionization mass spectrometry 

The performance characteristics of two lasers emitting in the mid 
infrared, an Er-YAG (2.94 /ym wavelength, 80-90 ms pulse width), and an Er- 
15 YSGG infrared laser (2.79 fjm wavelength, 80 ns pulse width), in matrix- 
assisted laser desorption/lionization mass spectrometry (IR-MALDI-MS) of 
biological macromolecules, was studied. Glycerol and succinic acid were used 
as matrices. In IR-MALDI sample consumption per laser shot typically exceeds 
that of UV-MALDI by about two orders of magnitude. Using glycerol as matrix, 
20 the reproducibility of the ion signals from shot to shot is comparable to the best 
values achieved in UV-MALDI. The same holds true for the precision and 
accuracy of the mass determination. For succinic acid all these values are 
significantly worse, due to the strong sample heterogeneity as typically found in 
dried droplet preparations. Metastable fragmentation is comparable for UV- and 
25 IR-MALDI in the law mass range, but is significantly less for the IR in the mass 
range above ca. 20 kDA, leading to an improved mass resolution and an 
extended high mass limit for IR-MALDI. 

In this Example, results and performance data for IR-MALDI analysis 
obtained with ER lasers emitting at 2.94 fjm and 2.79 fjm. and the applicability 
30 of delayed extraction for an improved mass resolution, is presented. In 
particular, is demonstrated that the extent of metastable fragmentation is 
different for the mass resolution of high molecular mass analytes. 
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EXPERIMENTAL 

The experiments were carried out with an in-house built, single stage 
reflectron TOF mass spectrometer of 3.5 m equivalent flight length. The mass 
spectrometer can also be used in linear mode. Unless specifically mentioned, 
5 the experiments reported here have been carried out in reflectron mode. In the 
split extraction source, ions are accelerated through total potential differences 
of 1 2-20 kV using either static or delayed extraction. In the delayed mode 
extraction, a maximum potential difference of 8kV can be switched; the 
minimum delay for ion extraction is 1 20 ns. No arcing was observed under 

10 these operation conditions in the positive or negative ion modes for any of the 
matrices used. A Venetian blind secondary electron multiplier. (EMI R2362) 
with a conversion dynode, mounted 10 mm in front of the cathode (ion impact 
energy 20-27 kV, depending on ion mass), or a Chevron microchannel plat 
detector (Galileo Co., Sturbridge, MA, USA), are used for ion detection. Signals 

15 were processed by a transient recorder with a time resolution of 2.5 ns (LeCroy 
9450A) in the majority of the experiments. For the high mass-resolution 
experiments a LeCroy 9348La recorder with a time resolution of 0.5 ns was 
used. The data are transferred to a PC for storage and further evaluation. The 
instrument is equipped with two infrared lasers, one emitting at 2.94 /jm (Er- 

20 YAG:Fa. Spectrum GmbH, Berlin, Germany: r = 80-90 ns, energy stability ca. 
±2-4% from shot to shot) and a second radiating at 2.79 jjm (Er-YSGG: 
Schwartz Electro Optics. USA: r = 80 ns; energy stability ca. ±2%). A 
frequency tripled Nd-YAG laser, emitting in the U V at 355 nm (r = 16 ns) is 
used for direct comparison between IR- and UV-MALDI. Single laser-pulses are 

25 focused to a spot diameter of ca. 200 /jm (IR) and 100 fjm (UV) on the sample 
under an angle of 45°. Samples are observed in situ with a CCD camera of ca. 
5 fjm resolution. The stainless steel substrate can be cooled with liquid nitrogen 
to a temperature of ca. 1 50-1 70 K. Its temperature is monitored by a thermo- 
couple with an accuracy of ± 5 K. 

30 SAMPLE PREPARATION 

A wide variety of small molecules can be used as matrices in IR-MALDI 
as described previously. Succinic acid (solid matrix) and glycerol (fluid or 
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frozen, solid) were prepferred. DHBs (2,5-dihydroxybenzoic acid mixed with 
10% 2-hydroxy-5-methoxybenzoic acid), a common matrix in UV-MALDI also 
functions in IR-MALDI and was used for comparison in some cases. 
Additionally, mixtures of compounds, e.g. succinic acid/DBHs or succinic 
acid/TRIS (Tris-hydroxymethylaminomethane) were found suitable. Solid matrix 
samples were prepared using the standard dried droplet method by mixing ca. 
1-3 //1 of a 10 4 10 5 M aqueous analyte solution with I-5 jjL of a 30 g/L matrix 
solution on the target and subsequently drying in a stream of cool air. 

For glycerol, analytes were either dissolved directly at a concentration of 
0.5-10 g/L, or the glycerol was mixed with an aqueous analyte solution to a 
final analyte-to-glycerol molar, ration of 2 x 10< -1 x 10* depending on the 
mass of the analyte. A volume of typically 1 fjL is applied to the stainless steel 
substrate and smeared out evenly over an area of ca. 3-4 mm 2 to form a 
homogeneous, transparent thin layer. If an aqueous analyte solution is mixed 
with the glycerol, the water must be evaporated off before sample introduction 
into the mass spectrometer, usually at a pressure of 10 2 -1 Pa. Samples are 
either inserted directly into the mass spectrometer or are cooled down to a 
temperature of ca. 1 50-1 70 K in liquid nitrogen before insertion. 

For the matrices investigated, IR-MALDI was found to be quite tolerant 
with respect to salts and buffers. Spectra of samples containing NaCI at 
concentrations of up to 200 mM, saccharose up to 20% (w/v) or Tris/HCI buffer 
up to 100 mM have, for example, been obtained without significant loss in 
spectral quality with succinic acid as well as glycerol. 
RESULTS 

Sample consumption and analytical sensitivity 

The analytical sensitivity of MALDI can be limited by either the minimal 
concentration of the analyte solution used for the analysis or the total amount 
of analyte available. For the actual measurement the total sample volume used 
for the preparation and the analyte-to-matrix ratio in the sample can be adjusted 
within certain limits to accommodate a given situation. In this section typical, 
as well as limiting, values for these quantities in IR-MALDI is presented and 
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compared to the corresponding UV-MALDI values. As an introduction a few 
general differences between IR- and UV-MALDI will be discussed. 

Given comparable laser spot sizes on the sample, 100-1000-times more 
material is desorbed by the IR as compared to the UV laser beam, because of 
the correspondingly smaller absorption coefficient and higher penetration depth 
of the radiation into the sample. Under the experimental conditions described 
here the single shot ion signals in the IR and UV typically were of comparable 
intensity. With that much more material ablated per laser pulse in IR-MALDI. 
the material is either primarily removed as larger clusters and small particles, as 
actually observed experimentally, and/or the ion yield is smaller by two to three 
orders of magnitude. 

The intensity of low mass signals, the signal-to-noise ratio and mass 
resolution are the main criteria for the quality of recorded mass spectra. The 
optimal molar analyte-to-matrix ration on the target for the signal intensity and 
quality of spectra depends on the molecular mass of the analyte. For analytes 
with molecular masses below ca. 50 kDa a ratio of 2 x 10«-2 x 10"* was found 
to be optimal. A ratio of 10* was found to work best for analytes with masses 
exceeding 50 kDa. Hence, in particular in the low mass range, the optimal 
analyte concentration in IR-MALDI exceeds that typically used in UV-MALDI by 
20 approximately one to two orders of magnitude. Spectra of reasonable quality 
can be obtained for analyte-to-matrix ratios down to 2 x 1 0" 6 -10 ', depending 
on molecular mass of the analyte. This corresponds to a sample consumption 
per laser shot of 1-50 fmoL In routine applications, spectra of IgG monoclonal 
antibodies (ca. 150 kDa) have been obtained with reasonable quality using 10' 
M aqueous solutions using either succinic acid or glycerol as matrix. This 
compares to values of only a few attomole in UV-MALDI. 

The results demonstrated the attainable sensitivity for a 
lysozyme/glycerol preparation. Here, 5.5 x 10 * »L of a 1.2 x 10"' mol/L frozen 
glycerol solution, corresponding to a total amount of prepared lysozyme of ca. 
0.7 fmol, were used for the preparation. This corresponds to a molar A/M-ratio 
of ca. la*. The frozen sample on the target had a diameter of about 1 .5 times 
that of the laser spot on the sample. After the ten laser exposures summed for 
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the spectrum some sample was still left on the target. Consequently, sample 
consumption of average was less than 70 attomo. per single spectrum. 
However, when using such low AIM ratios the mass resolution is down at 
rnlAm = 10-20. The signal-to-noise (S/N) ratio has also degraded substant.a.ly 
and the low mass background signals become excessive. For UV-MALD. a 
sample consumption in the high zeptomole range has been reported ( see, 
Jespersen et at. (1996) P- 217 in Mass Spectrom. in Biol. Sci, Bur.ingame, Ed.). 

In IR-MALDI there is also a pronounced dependence on the A/M rat.o of 
the yield of (non-specific) analyte oligomers or multiply charged ions. These 
tendencies are more pronounced in IR- as compared to UV-MALDl. A mass ^ 
spectrum of hen egg lysozyme from a preparation with a A/M-ration of 2 x 10 
was obtained. Homo-o.igomers of lysozyme up to the 25th mer (ca. 500 kDA) 
were identified in this spectrum. Conversely, signals of multiply charged 
analyte ions become dominant in the spectra for A/M-ratios below a value of ca. 
10- whereas oligomer signals decrease to values below the noise level. These 
trends have also been observed with succinic acid and the water of hydration as 
matrices 0 Sadeghi (1997) Rapid Commun. Mass. Spectrom. , 1J.:393), and are 
particularly pronounced for ana.ytes with molecular masses exceeding 50 kDa. 
Given the observed high yield of analyte homo-oligomers, the deta.ls of the 
distribution in particular the most abundant oligomer signal, can be influenced 
significantly by changes in the ion extraction conditions, e.g. by using low (soft 
or mild) or high (hard or harsh) ion extraction fields in a two stage ion source. 
Low extraction fields will shift the distribution towards a higher degree of 
oligomerization. This observation is a strong indication of gas phase processes 
in the expanding desorption plume besides possibly reflecting differences m the 
acceptance of the spectrometer under different extraction conditions. 
Reproducibility 

For Er-YAG lasers presently used, the pulse shot-to-shot stability is ca. 
±2% a value quite comparable to that of the best UV lasers, and, thus, does 
30 not contribute substantial.y to signal variation. Laser energy fluctuations from 
shot to shot, therefore, play only a minor role with these lasers. Another source 
of signal variation from shot-to-shot is the sample homogeneity, which depends 
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on the matrix and the preparation technique. Solid matrices such as succinic 
acid typically form heterogeneous microcrystalline patterns. High quality mas 
spectra can on.y be achieved from 'sweet spots'. For some matrices like DHB 
(2,5-d.hydroxybenzoic acid mixed with 10% 2-hydroxy-5-methoxybenzoic acid, 
5 this is also true for UV-MALDI. | n UV-MALD, 50-100 spectra can be obtained 
from any 'sweet spot', in contrast to on.y 3-4 spectra from a given 'sweet spot' 
m IR-MALDI, because of the much larger sample consumption per exposure. 

Liquid matrices such as glycerol form very homogeneous layers and 
spectra of comparable quality can be obtained from all locations across the 
10 sample. Also, the surface of these liquid samples recovers after every laser 

shot and more than 500 spectra of almost identical quality can be obtained from 
the same spot of a typical penetration. The spectrum of hen egg lysozyme was 
obtained after more that 250 shots on the same location at a laser repetition 
rate of 2 Hz. No significant differences in signal intensity, mass resolution or 
S/N ratio as well as oligomer distribution, were observed between the early and 
late exposures. As a result, reproducibility of IR-MALDI spectra was found to 
be comparable to, if not better than, that for UV-MALD, preparations if glycerol 
» used as matrix. For so.id matrices such as succinic acid reproducibility of ion 
s ig na.s from shot to shot may become a problem and considerable experience of 
20 the operator is often required for good results. 
Fragmentation 

Fragmentation is also an important parameter in MALDI-MS. Generally 
the yeld of so call -prompt- fragments, generated during the desorption and 
,on,za„on process on a time scale short compared to the ion extraction time, is 

25 very low in UV-as well as IR-MALDI. These fragments are detected a, their true 
(fragment) mass in linear as well as reflecting time-of-flight (TOF) 
spectrometers. This is also true under delayed extraction with delay times of 
ca. 1 „s or less. In IR-MALDI, a somewhat increased yield for such prompt 
fragments of oligonucleotides has been observed but the overall yield was very 

.0 low nonetheless. Metastable ion decay, on the microsecond to hundreds of 

m,croseconds time scale in the field free region of the TOF mass spectrometer is 
much more important. On the one hand i, degrades mass resolution in 
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reflectron (reTOF) instruments, but on the other hand it can be used for 
structural analysis in the post-source decay (PSD) mode. No significant 
differences between UV- and IR-MALDI have so far been found for the 
metastable fragmentation of peptides and proteins in the mass range up to ca. 
20 kDa. For analytes with a molecular mass above 20 kDa a markedly different 
metastable fragmentation has been observed. reTOF spectra of an IgG 
monoclonal antibody (mouse, MW ca. 150 kDa) obtained with UV- (matrix: 
DHBs) and IR-MALDI (matrix: succinic acid; matrix: glycerol) were obtained. 
The base peak of the parent ion had a rather symmetrical shape in both IR- 
MALDI spectra, whereas it showed a strong tail on the low mass side in the UV- 
MALDI spectrum, testifying to a significant amount of metastable decay. The 
peak width full width at half maximum (FWHM) decreases from a value of 2000 
Da in the UV spectrum to values of 1000 Da in the IR spectrum with succinic 
acid as matrix, and merely 700 Da if glycerol is used as matrix. It was also 
noticeable that the anlayte signals ride on a substantially elevated baseline in 
the UV-MALDl spectrum which results from delayed fragmentations somewhere 
within the ion source. No such baseline distortions were observed in the IR- 
MALDI spectra if the desorption fluence remained within a range of ca. 1-1.5 
times ion detection threshold fluence. 

It has been reported in the literature that in UV-MALDl the degree of 
metastable decay increases significantly with degrading source back pressure. 
Two spectra of an IgG monoclonal antibody (mouse) weree obtained with (a) UV- 
and (b) IR-MALDI at a source back pressure of 4 x 10* Pa as compared to a 
back pressure of 4 x 10" 4 Pa used to obtain the spectra described above. The 
> UV-MALDl spectrum exhibited signals with a substantially increased tailing to 
the low mass side, particularly visible for the oligomers 2M and 3M 2 "; whereas 
no such tailing was seen in the IR-MALDI spectrum. 

Another observation is the dependence of metastable fragmentation on 
the analyte-to-matrix ratio. In reTOF UV-MALDl, too high a A/M-ratio will 
0 usually result in a degraded S/N ratio and a loss of mass resolution. This was 
shown in with spectra for cytochrome c and DHB as the matrix. A higher A/M- 
ratio of 10 3 used to obtain the spectrum, resulted in a strong, low-mass tail of 
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the peaks, again stronger for the dimer as compared to the parent ion peak, but 
not seen in the spectrum obtained from a sample with an A/M ratio of 10<. The 
IR-MALDI spectrum shown of the same sample used showed no such tailing, 
again indicating substantially less metastable fragmentation. For IR-MALDI of 
5 cyto-chrome c using the water of hydration as 'intrinsic' matrix the molar A/M 
ratio is even higher (ca. 5 x 10 3 ), yet no significant metastable decay was 
observed (Berkenkamp eta/. (1996) Proc. Natl. Ar.art Rri iiqa o^n P?! 

It is a generally held notion that collisions of ions with matrix neutrals in 
the plume and with residual gas molecules in the spectrometer are the major 
10 cause of metastable fragmentation (see, e.g., Spengler et at. (1992) J, Phvs. 
Chem, 96:9678). Considering that much more material is desorbed in IR versus 
UV-MALDI, resulting presumably in a more extended plume, and that in addition 
proportionally more of the absorbed laser energy goes into the analyte molecule 
in the IR, the finding of much less metastable fragmentation in IR-MALDI under 
all the different conditions presented above was not expected. Contrary to 
intuition, IR-MALDI seems to be a milder method than UV-MALDI. 
Accessible mass range 

The lower degree of fragmentation gives IR-MALDI an advantage over 
UV-MALDI for the analysis of very high mass analytes, particularly when an ion 
20 mirror is used. Not only does this lead to stronger signals of large parent 
molecular ions, it also, and more importantly, allows the use of higher laser 
fluences up to about twice the ion detection threshold fluence without 
deterioration in spectral quality as would be the case in UV-MALDI under such 
conditions. This increases the high mass signals even further. 

A spectrum of gramicidin-S-synthetase of mass 510 kDa, prepared in 
glycerol matrix from an aqueous solution containing 50 mM Tris/HCL, 18% 
(w/v) saccharose and 5 mM dithiothreitol, with a quite acceptable S/N ratio and 
mass resolution of m/Am= 50 was obtained. No signals of this analyte could 
be obtained with UV-MALDI under a variety of conditions tried. 

A mass spectrum of an IgG monoclonal antibody (mouse) demonstrated 
that IR-MALDI in combination with a TOF mass analyser can be used for the 
analysis of biomolecules with molecular weights exceeding 1 MDa. Multiply 
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charged ions of the 13-mer homo-oligomer of ca. 2 MDa mass oou.d 
unambiguously be identified in the speotrum and signals of ions of other 
oligomers with mft vaiues as high as 900 000 were aiso oleady iden„f,ed ,n the 
spectrum. 

Mass resolution . .... 

Delayed ion extraction (DE) is used for enhanced mass resolut,on ,n UV- 
MALDI-TOF MS. It was not immediately clear whether DE would be as 
advantageous in IR-MALDI as well, considering the much higher desorbed 
sample amount and the possibly substantially different plume expanse 
dynamics, .n fact, for peptides in the 1000 Da mass range, the mass reso,ut,on 
is only about 200 (FWHM). down from about 1000 for UV-MALDI under 
otherwise cdmparab.e conditions, indicating difference in the ion generaoon 
process. Nonetheless DE gave equal mass resolution fo, ,R- and UV-MALDI 
within the accuracy o, the measurement. This was demostrated with a reTOF 
spectrum (sodiated gramicidin-a. MW 1164.5 Da. obtained with the Er-YAG 
laser at 2.94 „m and 80 ns pulse width and succinic acid as matrix. The mass 
resolution in this spectrum was 1000. corresponding ,0 a width o, the —a, 
peaks of 3.5 ns, limited by the time resolution of the dual MCP detector (3.0 
ns, Using a reTOF for Mellitin .2846 Da, a mass resolution of 9500 and one of 
1 500 for cytochrome c (1 2360 Da) were obtained. Thus an enhancement ,n 
resolution by factors of ca. 50 for peptides and of 4 for cytochrome o was 

achieved. , . .- 

,n the high mass range mass resolution in the linear TOE w„h statrc nn 
extraction at 20 kV is iimited to a value of ca. 50. equal for ,R- and UV-MALDI. 
26 ,n both cases the mass reso.ution is determined by the distribution of init.a, ,on 
velocities and kinetic energies. Using DE the mass resolution could be .mproved 
by a factor of 3 to equal values of ca. 1 50 for IR- as we,, as UV-MALDI. 

For analytes exceeding 50 kDa mass resolution in IR-MALDI can. 
however, be improved even more with a reTOF analyser, in contrast to UV- 
30 MALDI. As demonstrated by the spectra discussed above (reTOF spectra of an 
, 9 G monoclonal antibody,- the strongly decreased metastble fragmented ,n IR- 
MALDI resu,ted in a peak width of only 700 Da for the parent ion peak of a 
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monoc.ona. antibody desorbed with the IR , aser out of . g|ycero| m3trjx 
compared ,o a pea k w idth of c, 2000 Da for ,ha UV-MALD, spectrum ' The 
pea k w, t h of 200 Da. corresponding t0 a region of about 20 0. was 
obta.ned for a« experimental conditions tested. Usin 9 either a matrix other , ha 

mass lut , on was obsefved ^ jR mald) reTQF spec(ra P 

large masses. ,n agreement with observations made in UV-MALD. E,ec, 
'0 mass spectra o, an .gG monoclone, antibody show tha, ,h ' " 
Da reflertQ th» ^ at the peak width of 700 

mo, tk 6 PS °' Vari ° US Ration states o, the 

molecu e. The mstrumenta, mass resolution has. therefore, been even 
om h h, gher . TNs assumpt|on supported ^ a n 

15 Isstrr WWCh °' *" was observed a, 

15 mass 224 kDa with a peak width of inn n opserved at 

of 300. ' C ° rres P° ndin 9 to a mass resolution 

Mass accuracy and precision 

Similar to the reproducibility of the intensity o, ion signa,s in IR-MALDi 
precon o, the mass determination as given by the standar deviation o^ 
20 sequenhal measurements depends on the matrix , aeV,a,,on of 

on the shot-to-sho, variation o, the ,aser pZl^ m0rPh0 ' 09y, " 

dried dr7p r ,e P t r0mPt ^ ^''^ *«* " acid and 

dr, d op,e, preparat.ons. mass precision is typica„y 400-500 ppm for 
molecular weights up to 1 50 kn* it s. r • 

prec.sron of the mass determination. If, for examnle ,h» , 
30 intentionally from threshold ,/ , ,o , 51 a„ " r8 '' Sed 

for cytochrome c „o,a, fligh me a 2 t " " "** °' * °' ,% 
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energy. For IR lasers of current design with glycerol as matrix, a precision of 
the mass determination of 200-400 ppm can be achieved up to a mass of 
approximately 1 50 kDa. For ana.ytes below 30 kDa this precision is lower by 
about one order of magnitude than the values typically obtained in prompt 
extraction UV-MALDI. In the high mass regime, precision in IR-MALDI was 
found to be better by at least a factor of 2, most likely due to the enhanced 
mass resolution of IR-MALDI in this mass range. 

The mass accuracy for prompt extraction IR-MALDI was determined by 
external calibration with 3 well known standards. (Low mass range: 
angiotensin (human), mellitin, bovine insulin; high mass range: cytochrome c 
(horse heart), apo-myog.obin (horse), subtilisin Car.sberg (bacillus subtilis)). In 
the mass range up to 30 kDa, 5 sum spectra of 15 single shots each were used 
to obtain the calibration factors from the calibration spectrum and the mass of 
the 'unknown' in the second spectrum. For both matrices, succinic ac.d and 
glycerol, the absolute mass accuracy has been found to be 10 2 -5 x 10* ppm 
depending on molecular mass. For proteins up to 40 kDa the mass accuracy of 
100-500 ppm is in good agreement with the previously described numbers for 
UV-MALD. (static extraction). 20 For ana.ytes exceeding 40 kDA accuracy .s 1-5 
x 10 3 ppm. 
20 CONCLUSIONS 

As judged by the lesser degree of metastable fragmentation compared to 
the UV, IR-MALDI appears to be the 'milder' of the two techniques for 
generating biomo.ecu.ar ions. Glycerol or an equivalent materia, is the matnx of 
choice for many applications because of its superior reproducibility in 
25 comparison to solid matrices. Among the two .asers tested in this study, Er- 
YAG laser performs slightly better than Ef-YSGG laser for glycerol and 
substantially better for succinic acid as matrix. The lesser metastble 
fragmentation makes IR-MALDI also particularly well suited for the analys.s of 
high mass analytes in the reTOF mode. Delayed ion extraction works well w.th 
30 IR-MALDI, with results comparable to UV-MALDI. 
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EXAMPLE 3 

Detection of double-stranded DMA by IR-MALDI mass spectrometry 

In this Example, the use of IR- and UV-MALDI-MS for the analysis of ds- 
DNA using glycerol- and ATT, respectively, as matrices is described 
5 This example shows that IR-MALDI can be used effectively as a diagnostic too.. 
IR-MALDI, using a glycol matrix, such as a glycerol matrix, yielded exce.lent 
results for larger double stranded DNA. These results were achieved adjusting 
the ,onic strength through the addition of salts. Very little fragmentation and a 
routme sensitivity in the sub-picomo.e range were observed in IR-MALDI when 
10 double stranded ana.ytes harboring 70 base pairs or more were probed. 

Double stranded DNA molecules ranging from 9 kDA to over 500 kDA 
were desorbed and analyzed by MALDI TOF mass spectrometry. IR-MALDI with 
glycerol as matrix yielded excellent results for larger double stranded DNA by 
adjustmg the ionic strength through the addition of salts. Very little 
15 fragmentation and a routine sensitivity in the sub-picomole range were observed 
m IR-MALDI when double stranded analytes harboring 70 base pairs or more 
were probed. In the lower mass range (up to approx. 70 bp), UV-MALDI with 
6-aza-2-thiothymine as matrix was the ionization method of choice because it 
allowed specific double stranded complexes containing re.ative.y few base pairs 
to be desorbed in intact form. In this mode an essentia.ly quantitative detection 
of the double stranded form was observed for a 70-mer. The UV-MALD. was 
accompanied by a significant fragmentation and a resulting reduced sensitivity 
and mass resolution. 

The methods described demonstrate that MALDI-MS, particular IR- 
MALDI, can be used for the analysis of large DNA/DNA and DNA-protein 
complexes. 

Materials and Methods 

Samples. The synthetic oligodeoxynucleotides were obtained from Pharmacia 
B,„tech (Uppsala. Sweden). To form double stranded oligodeoxynucleotides 
.nd,v,dua, complementary single strands were mixed in a 1:1 ratio (typically 5- 
10 pmolW in water), heated to 75°C for 2 minutes and cooled to 10°C over 30 
mrnu.es. The synthetic oligodeoxynucleotides and their mixtures were adjusted 
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t0 5 mM EDTA and 2M NH«-ace,age. and precipitated with two volumes of 1:1 
nature of ethanol and 2-propanol. Samples were re-dissolved in water to 
concentration 10-20 pmol/pl. 

The DNA plasmids pBR322 and Bluescript KS + were punf.ed from Eco/r 
B cells by a Qiagen midiprep kit according to the supplier's protoco, (Qiagen 
GmbH, Hilden, Germany), and subjected to restriction enzyme digests. The 
restriction enzymes were obtained from New England Biolaba GmbH 
(Schwalbach/Tannus. Germany), end used according to the supp.ier's 
suggestion except that the addition of bovine serum albumin was omitted ,n the 
,0 case of EcoRV digest. The restriction enzyme digested DNA was adiusted to 5 
mM EDTA and 2M NH 4 acetate, precipitated with 2 volumes of ethanol, and 
finally re-dissolved in water to approx. 0.5 pmolftrl. All restriction enzyme 
digests were verified by agarose gel electrophoresis. 

MALDI-MS analysis. Positive ion IR-MALDI-MS experiments were earned out 
,B with an in-house built iinear/signle stage reflectron time-of-flight (TOR mass 
spectrometer of 3.5 m equivalent flight length (reflectron mode). In the spirt 
extraction source, ions were accented through tota. potentia, differences of 
,6-25 kV. A Venetian blind secondery electron multiplier with a converse 
dynode in front of the cathode (total ion impact energy 20-40 kV, depending on 
20 ion mass) was used for ion detection. Unless specifical.y mentioned, all 

experiments were carried out in refiectron mode with static ion ex.ract.on. An 
er-YAG-Laser emitting at 2.94 ^m (Spectrum GmBH, Berlin. Germany; r = 80 - 
90 ns, energy stability ca. ± 2-4% from shot to shot) was used. The laser-pulse 
is focused to a spot diameter of ca. 140 m on the sample under an angle of 
25 45" The instrument is described in e.sewhere herein (see, also Berkenkamp et 
a, Hap* Common. Mass Spaeth. 1997, 11, 1399-1406). For IR-MALDI, 
typically 0.5-1 „L of the glycerol matrix was mixed on the stainless steel targets 
with ca 0.5-1 pL of an aqueous analyte (typically 0.5 pmol/„l> to a molar 
analyte-to-glycero, ratio of 10'-10=. To stabilize the ds-DNA.NH 4 -aceta,e or 
30 Tris-HCI (pH 8.01 were added to a final concentration of about 40 mM. Before 
sample introduction into the mass spectrometer most of the water was 
evaporated off in the rough vacuum at a pressure 10, -1 Pa. In order to have a 
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more controiied and gradua. evaporation o, , he remaining water under high 
— =ond it ,ons. the target with the sampies was cooied by piunging in into 
l.qu,d n„rogen before insertion into the mess spectrometer 

5 2000 r i,iVe R '- 0n UV - MALD '- MA ~ «« "c— on a prototype Vision 
2000 (ThermoB.oanaiysis, Heme, Hempstead. UK) mass spectrometer in the 

n^:r USin9 " 2VedtonMraC,i0n,Da "e ions were acceierated 
to 20 k eV ,„ the ,o„ source and additional posfacceierated through ,0 - ,9 
KeV (depen , ng Qn fon mass) by a conversjon dynode h from 

elect on mu,„p ( ,er. This instrument is described in detaiis eisewhere (G ruic- 

consul f T L 199? ' 32 ° 84 - 9 "- — D ' ~* 

c ousted or e.ther ca. ,0 NH.-loaded cation exchange beads .Nordhoff era, 

ZT in 2 Z" n - SPe " r0m - 1 " 2 - 6 - 771 - 6 ' addad » o-W * «og/i3- 

50% acaln tTTr " ^ ' " ° M ° *" 10 ™ M in 

15 20 omo, ^ MSeS ' °' 5 ^ " °* " a <"— — -.ution ,,0. 

h^h :nTe Were miXed " maUiX SO,U,i ° n - Th °- h - 
a iow di « COnCemra a ' SO US6d 3 - HPA * «*- «o 

: r on wi,h ,he att pre ~- - — - - * 

Results 
20 tR-MALDI-M.c; 

As shown herein. (R-MALDJ-MS with giycero, or other such composition 

nl T r arkab ' e COmbina, ' 0n **" ™^ - -ids. 
S,ng,e stranded nucieic acids containing oyer 2000 nucleotides were 

successfully detected. Destabiiza.ion of the DMA doubie strand by the absence 

e? 'I : pure 9lycero ' ma,rix and by ,he ,r — - ° 

response tor the predominant observation of the singie stranded fl of tL 

addif r, EXamPle " d ° Ubla «™*> - 'tabbed in soiution by the 
d„,on of sa„s. in particuia, the addition o, NH.-acetate or Tris-HC, the 

hnH~ S0 ' Uti0n ,C ab ° Ut 50% " - ~ — amine a, 

tne pH of 8.0 was used. 
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When pure glycerol was used as matrix, the barely resolved ind.v.dual 
sing.e stranded 70-mers form the base peak in the spectrum. After the addit.on 
of either HN 4 -acetate or Tris-HCI at pH 8.0 to the sample preparation, the s.gnal 
of the doub.e stranded DNA forms the base peak in the spectra. A substant.al 
fraction of the signal at the nominal mass of the single strands is probably 
comprised of the doubly charged ds-moiety. No signal corresponding to a non- 
specific trimer is present. Thus, it is this clear that the addition of sale to the 
.R-MALD. sample preparation allows the detection of specific double stranded 
DNA. There is no significant difference between the spectra obtained from the 
two samples containing either NH 4 -acetate or Tris-HCI. 

Salt-stabilized ds-DNA is still prone to acidic denaturation. In one 
experiment, a 750 bp fragment was first stabilized by TRIS-HCI resulting in a 
strong signal from the singly charged doub.e stranded species. In contrast, only 
singly and doubly charged ions of the single strands are observed after the 
, same sample was on-target acidified with succinic acid, a commonly used IR- 
MALDI matrix. A signal corresponding to the triply charged doub.e strand was 
clearly evident. This result shows that the effect of succinic acid add.t.on .. a 
true denaturation. not a genera, increase in the charge state. Thus, ds-DNA can 
be stabilized and destabilized by physico-chemical means in the sample 
, preparation. The sample used was actually an equimo.ar mixture of the 
observed 750 bp species and a fragment well above 3 kbp generated by 
restriction enzyme digest of a plasmid. The larger species could not be 
detected, because ions of this mass (>2.0 MDa) are beyond the current mass 
limit of ca. 700kDa for IR-MALDI of large DNA. The spectra were recorded 
5 with delayed ion extraction in the linear TOF mode in order not to discriminate 
against ds-DNA surviving Resorption and ion acceleration, but dissociating in the 
flight tube. These would not have been detected in a reflectron TOF 
configuration. The observed background signals, increasing strongly with 
decreasing mass is typical for the glycerol matrix, when measuring samples 
jO exceeding 100 kDa in the linear DE-TOF mode. 

These observations of a stabilization of double strands via an increased 
ionic strength were extended using restriction enzyme digested plasmid DNA 
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containing fragments of 280 bp, 360 bp. 920 bp. and , .40 kbp. Peaks at 
masses of , 75 kDa. 223 kDa. and 565 kDa. corresponding to the double 
stranded species of the three smaller fragments are observed in the spectrum. 
Pea s observed a, 87.5 kDa. , 12 kDa and 283 kDa nominally represent the 
single strands. Based on these results, i, was shown that 

T^T'^ Si9ni,iCan " y " d0minam ' y ,0 ^ "«"*■ A verv sma„ 
Signa, of the monomer/doubly charged ds ion of the iarges, fragment o, 1 40 

ion'o'fTr maSS 4M kDa ' m/Z " V3,Ue °' 864 kDa °< *« ""«* 

ion of*, ds species is beyond the current fimi, „, detection of ca. 700 kDa 

The addmon of stabilizers, in particular Tris-HC. to a g.ycero, sampie 
prepar a , ion lead peak broadening ^ ^ ^ 

9 .Vcerol. noticeable as an extended peak tailing to the high mass side. The 
mass resolution decreases by a factor of two. to around 25 in reflect™ TOF 
mode for the mid size species, whereas the mass resolution o, the 920 bp 
> fra Smantaswellasthe70-merds-DNAinisabout5O ~ 

An observed step rise o, the peaks of ds-ion on the low-mass side 
«ast„,es to a very limited metastable decay, as was already observed for the 
s,gna,s of ss-species as shown here. This very ,ow degree or absence of 
me testable fragmentation is in sharp contrast to UV-MALDI-MS ana.ysis o, 
nuc a, c acids , where metastaWe (and ^ source| fragmenta(jon ^ a 

factor, particularly for the enelysis of lerger ONA fregments 

pe rf ormer7 entS T^™"' the °< «<- "-detection were a,so 

performed, f . s , Bnlf , cant portjon Qf ^ ^ ^ ^ 

Phas , ex formatioR of sjng , e strands formed ^ soiutjon ^ J> 

280 27 T: S : n0n " £PeCi,iC C ° mPleXeS * »■ b ~ strands o, the 

280-mer and he 360-mer would have been observed. Such non-specific 
complexes, often observed for proteins, were no, observed in the DNA 

predominantly retained es such throughout the MALDI process 

As shown herein, i, is also possible to desorb intact ds-DNA from pure 
givcero, without a stabilizing selts. DNA fragments over a wide mess range 
were tested for intec, detection without stebilizetion. The condition of the DNA 
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sample was very crucial for these experiments. Only freshly prepared DNA 
samples were reproducibly detected as double stranded ions. In contrast to the 
sa„-s,abi,i 2 ed samples, the laser fluence had to be carefully contro.led. Near the 
threshold fluence for the detection of analyte ions (1 .2HJ. a 190 bp fragment 
5 formed predominantly ds-ions. Increasing the fluence to 1 .5 H. resulted in the 
exclusive detection of the ss-moiety with the two single strands partially 
resolved. Hence, it is possible to denature large DNA double strands by 
adjusting the laser irradiance alone. The largest ds-DNA desorbed from pure 
giycerol was a 750 bp fragment, the smallest had 100 bp. Generally, the 
10 smaller the fregment. the closer to H. the laser fluence had to be when 

resorption of the double stranded analyte was desired. The 100 bp fragment 
required strictly threshold fluence and, in addition, a low field strength for the 
ion extraction (S 150 V/mml. For an intact desorption of a 500 bp and a 750 
bp ds-fragment, fiuences as high as 1.7 H„ and 2 H„. respectively, could be 
15 used. 

UV-MALDI-MS 

Attempts to analyze double stranded DNA with 60 base pairs by IR- 
MALDI were not reproducible with respect to detection of the ds-form, and 
smaller analytes did not show signals of specific ds-ions at a... Therefore, UV- 
20 MALDI-MS with 6-aza-2-thiothyrnine (ATT) matrix was used for detect.on of 
relatively small ds-DNA fragments. As was reported by Lecchi & Pannell 
(Lecchi etal. J. Am. Soc. Mass Spectrom. 1995, 6, 972-75), signals of the ds- 
species were obtained down to fully complementary synthetic 12 mers. In 
contrast, an equimo.ar mixture of a 1 2-mer and a fully complementary 8-mer d.d 
25 not generate any signal of the ds-ion, indicating that this is about the lower s.ze 
Hmit for intact desorption of ds-DNA. It should be noted that no spectra could 
be obtained in reflectron TOF mode with the ATT matrix, probably because of 
excessive fragmentation, already noticeable in the linear TOF mode. 

The upper limit for the detection of ds-ions by UV-MALD. with an ATT 
30 matrix was also determined for this system. The two completely 

complementary synthetic 70-mers, investigated by IR-MALDI produced a signal 
of the ds-ion in the ATT/UV-MALD. spectrum as well. Two barely resolved s,g- 
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na.s of the two individual 70-mers dominated the spectrum when 3-HPA is used 
as matrix). The resoiution here was comparable to the one seen in the gly- 
cerol/IR-MALDI spectrum o, the same sample. This experiment confirms tha, 
de,ect,on of iarger analytes is feasible with ATT and tha, signals of ds-DNA 
cannot be obtained with 3-HPA, a, leas, when prepared a, room temperature 
To prove rigorously that the signal a, 43.5 k Da represented the intact ds-DNA 
rather than partially or fu„y unspecific homodimers, a synthetic 80-mer was an- 
nea ed to either a complementary 70-mer (complementary oyer 62 consecutive 
nucleobdes, or a non-complementary 70-mer. and analyzed by ATT/U V-MALDI . 
.« was evdent from the spectra that only the complementary 70/80-mer mixture 
produces a ds-signa,. whereas the two non-complemen.ary single strands form 
the base peaks in the spectrum. The resuits also showed that in spectra for 
wh,ch the ds-moiety forms the base peak, minor signa,s appearing a, half the 
mass represent the double charged ds-DNA. rather than the singly charged 
15 ,nd,v,dua, s,n g ,e strands. Therefore ATT is a highiy selective matrix for the 

dLTsL 0 ' dsDNA by UV " MALDI even in the upper part of ,hs mass 

The results in this Example show that MALDI-MS analysis of double 

20 uTZ POSSib ' e 3nalV,eS in S ' Ze ra " 96 fr ° m ' 2 * «o 

920 bp. UV-MALDI-MS with ATT as matrix was used for fragments in the s, 2 e 

range up to 70 bp; and IR-MALDI-MS with glycerol as matrix and sal, addition 

was used for DNA molecules of 70 bp and upwards. The rather large sample 

,10 " 2 h ° pmo " in uv - maldi - ms wi,h att •* - — -< 

«he use of the method for larger analytes because this amount of sample is 
dude mass.ve for samples of bidogice, origin. The main reason for this reduced 
sens t,v,,y , s the extensive in-source fragmentation, apparent in a„ spectra 
employe the ATT matrix. Fragmentation o, double stranded species was 
donated by loss of single or multiple bases and/or backbone cleavages rather 

30 1 ,l0 " in '° Si " 9,e StrandS - ^™™<°" °< - double stranded 

ana M ^ C ° mParab,e """" *> - « *• observed for single stranded 
ana yte Th,s was observed even for the double stranded 70-mer. where every 
nucleohde should be base-paired. These results indicate tha, the non-covaj 
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ds-structure is stable under conditions that induce cleavage of cova.ent bonds ,n 
UV-MALDI-MS, provided the DNA is maintained at a minimum ionic strength. 

■in contrast, very little fragmentation was observed for IR-MALDI-MS with 
glycerol. Even large DNA molecules when analyzed in reflectron TOF mode, 
having flight times longer than 1 ms, did not exhibit a substantia! metastab.e 
fragmentation. This generally applies to analytes in single and double stranded 
form in addition, for pure glycerol as matrix, excess laser energy in IR-MALDI- 
MS leads to denaturation of non-stabilized double strands rather than cleavage 

of covalent bonds. 

Ds-DNA is stabilized in aqueous solution by salt addition, i.e. an .ncrease 
of tbe ionic strength. The stabilization results from the condensation of posit.ve 
ions near the phosphate backbone, thereby partially neutralizing the negat.ve 
backbone charge and reducing the repulsive electrostatic interaction between 
the two strands. The results presented here and the fact that DNA is nearly 
insoluble in glycerol suggest that the DNA retains enough of a shell of solvent 
water in the glycerol to afford the cation condensation, even after some time ,n 
the vacuum of the analyzer. 

A correlation between the length of the double stranded region in an 
analyte and the degree of ds-DNA in UV-MALDI-MS with ATT as matrix was 
observed. This corroborates the high specificity of UV-MALDI for the analysis of 
ds-DNA. The specificity is substantiated by the observation that a self- 
complementary RNA, forming more stable double strand than a correspond^ 
DNA. gives a higher signal of the ds-form at the 1 2-mer level than a DNA of 
equal length. 

i The fact that UV- and IR-MALDI allow the specific analys.s of ds-DNA 

indicates that these methods may also be used to study non-covalent 
complexes between DNA and proteins. The prerequisite for such invest.gat.ons 
is the ability to keep the DNA in its double stranded form. IR-MALDI-MS 
demonstrated a sensitivity in the subpicomole range making a method of cho.ce 

0 for analyses, such as for use in diagnostic methods, that require high 

sensitivity. Furthermore, as shown herein, the sensitivity can be increased by 
at least a factor of 100 by a miniaturization of the sample volume using a p.ezo- 
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electric pipette (see, published International PCT app,ication No. WO 98/20166 
and also copending, allowed, U.S. application Serial No. 08/787,639, The fact 
that the addition of Tris-HC, or NH 4 -acetate is required for a reproducible 
acquisition of ds-DNA spectra is rather a benefit. Most nuc.eic acid/protein 
comp,exes need pH- and salt-adjusted environments for stability, .nteractions 
between nuc.eic acids, DNA triple helices and the hybridization between 
mod,f,ed oligonucleotides and native DNA are further fields of application. 
EXAMPLE 4 

Small, sealed-off TEA-C 0j -,asers emitting in the 1 0/ ,m wavelength range 
are commercially available and are comparable in size, price and ease of 
opera„on to the nitrogen lasers commonly used for UV-MALDI-MS The 
performance data of such lasers for IR-MALDI-MS has been investigated This 
includes the use of delayed ion extraction for an enhancement in mass 
resolubon and accuracy as well as spectra for the certain particular IR 
applications. 

A sealed-off TEA-C0 2 laser of 1 0.59 „m wavelength to -TEA. Laser 
Science. Inc. Franklin. MA, was coupled to an in-house-buil, TOP instrument 
w,th a linear ,2.2 m , and . reflection port „ 5 m ^ ^ ^ 

comparison experiments an Er.-YAG laser W . 2.94 ^m, or a frequency tripled 
Nd:YAG U = 355 nm. are available on the same instrument. 

RESULTS: Fumaric acid and glycerol were found to perform best as 
matrices for 10.6 „m wavelength. Whereas fumaric acid shows a better mass 
resolution in the mass range < 40 KDa. especially with static ion extraction 
glycerol performs best for high masses and gives an excellent reproducibility. 
The analytical sensitivity was tested for peptides and a fumaric acid matrix A 
sample load of only a few fmoles and a molar ana.yte-,o-matrix * ,0" were 
sufficient to generate spectra. 

The low degree of metastable decay, particularly for proteins with a 
mass > 100 IcDa. described for IR-MALDI at 3 „m is observed for ,0.6 „m as 
well. ,n combination with a reflection mass spectrometer, this leads to a larger 
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accessible mass range. For example a mass resolution of 125 was obtained for 
a mouse monoclonal antibody (IgG) at 1 50 kDa using a glycerol matrix in the 
reflection mode. Homo-oligomers of the antibody up to a mass of 1.35 MDa 
were unambiguously identified in such spectra. 
5 Delayed ion extraction substantially improves resolution for peptides, 

similar to UV- and IR-MALDI at 3 ^m. In spectra of Substance P with glycerol 
and fumaric acid matrices, mass resolutions of 4500 and 5000-7000 are 
achieved. The isotopic distributions are somehow disturbed by intermediate and 
double peaks in the spectra, resulting in an overall inferior performance, 
10 compared to the UV or mid-infrared laser systems on the same instrument. This 
peak "fragmentation" is tentatively assigned to the heterogeneous beam profile 
of the (r) 2 laser. However, the mass resolution is still sufficient to get a mass 
accuracy in the low ppm range for peptides. The mass of Substance P, for 
example, was measured with an error of only 10 mDa, using internal calibration 
15 with Angiotensin and Renin. 

A study was conducted of IR-C0 2 -MALDI-MS of myoglobin (horse) 
electroblotted onto a polymer membrane (Immobi'on P) after gel separation. A 
succinic acid matrix was used. A comparison of the spectra quality with that of 
spectra obtained from such membranes with the Er:YAG laser at 3 ^m indicates 
20 that the latter may be preferable. 

The analysis of double stranded DNA up to a mass of more than 300 
kDa using a glycerol matrix was also conducted for a 515-bp PCR product. IR- 
MALDI-MS at 3 and 10.6/ym wavelength has very similar features; use of the 
3/jm wavelength may be preferred in certain applications. 
25 EXAMPLE 5 

IR-MALDI of large nucleic acids 

MALDI-MS of proteins above ~20kD with lasers emitting in the 3//m 
wavelength region induces significantly less fragmentation of desorbed ions 
than UV-MALDI, particularly if glycerol as (liquid) matrix is used. Tests of lasers 
30 emitting at different wavelengths and various matrices for analysis of large 
nucleic acids have been conducted. It was found that the Er-YAG laser 
(2.94/mi) with a glycerol matrix is a gentle combination for the intact 
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desorption/ionization of nucleic acids. The experiments were carried out with a 
single-stage reflectron, time of flight (refTOF) mass spectrometer with a split ion 
extraction source of 16kV acceleration potential operated with either prompt or 
delayed ion extraction [S. Berkenkamp, C. Menzel, M. Karns and F. Hillenkamp, 
3 Rapid Commun. Mass Spectrom., 11, 1399, (1997)]. An ErrYAG laser 
M = 2.94//m, r = 85ns, Spektrum GmbH, Berlin, Germany) was used for 
desorption. Prior to sample preparation, the glycerol matrix was incubated with 
an equal volume of a H + -cation exchange bead suspension. 

As demonstrated for a plasmid DNA restriction enzyme digest (mixture of 
> 1045bp and 1913bp fragments, 322 kDa and 592 kDa), IR-MALDI-MS with 
glycerol matrix can be used for oligomers of around 2000nt. For all large DNA 
measured, the mass resolution of ion signal (FWHM) was about 50 and 
appeared to be relatively independent of the ion mass. IR-MALDI-MS of DNA 
could be used to measure masses of approximately 700kDa. Large RNA can 
also be analyzed by IR-MALDI-MS as demonstrated for a 1206nt transcript (ca 
388 kDa), synthesized in vitro. Up to this mass RNA and DNA exhibit 
comparable stability. For all measured samples of high mass DNA and RNA the 
mass accuracy was between 0.5% and 1 % of that calculated from the 
sequence. Even the mass of a 2180nt fragment was determined with a 0.6% 
accuracy. Signals of IgG monoclonal antibodies of well defined mass and of 
their oligomers have been used for the mass calibration. The sensitivity of IR- 
MALDI-MS of large nucleic acids with glycerol as matrix, evaluated for a PCR- 
product of approximately 515b P , was found to be in the low femtomol range. 
Spectra with reasonable quality could even be obtained from SOOamol total load 
of sample. All results reported above have been obtained with only limited 
efforts in a sample purification. For the restriction enzyme digest fragment a 
one step purification (precipitation) appeared to be sufficient. For the RNA, 
additionally a reverse phase column was prepared. 

Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended 
claims. 
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WHAT IS CLAIMED IS: 

1 . A process for performing matrix assisted laser 
desorption/ionization (MALDI) of a nucleic acid molecule for analysis by mass 
spectrometry, comprising the steps of: 
5 (a) depositing a solution containing the nucleic acid and a liquid matrix 

on a substrate, thereby forming a homogeneous, thin layer of a nucleic 
acid/liquid matrix solution; and 

(b) illuminating the substrate with infrared radiation, whereby the nucleic 
acid in the solution is desorbed and ionized. 
10 2 . The process of claim 1 , further comprising, determining the mass 

of the nucleic acid is determined by MALDI. 

3. The process of claim 1 , further comprising, determining the mass 
of the nucleic acid is determined by MALDI and thereby detecting the presence 
of the nucleic acid in a sample. 
! 5 4. A process of any of claims 1 -3, wherein the liquid matrix has at 

least one of the following properties: i) is miscibie with a nucleic acid 
compatible solvent, ii) is vacuum stable, and iii) is of an appropriate viscosity 
for dispensing the thin layer of micro.- to nano- liter volumes of matrix alone or 
mixed with a nucleic acid compatible solvent. 
20 5 . The process of any of claims 1-3, wherein the liquid matrix is 

sufficiently non-volatile to not evaporate during the illuminating, desorbing and 
ionizing step. 

6. The process of claim 1 , wherein the liquid matrix can form a 
glass under when cooled and/or pressurized. 
25 7 . The process of claim 1 , wherein the matrix comprises a sugar, a 

monosaccharide, or a polysaccharride. 

8. The process of claim 1, wherein the matrix comprises a polyglycerol, 
sucrose, mannose, galactose, ethylene glycol, propylene glycol, 
trimethylolpropane, pentaerythritol, dextrose, methylglycoside or sorbitol. 
30 sucrose, mannose, triethanolamine, lactic acid, 3-nitrobenzytalcohol, 

diethanolamine. DMSO, nitropheynloctylether (3-NPOE). 2,2'dithiodiethanol, 
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tetraethyleneglycol, dithiothrietol/erythritol (DTT/DTE), 2,3-d.hydroxy-propyl- 
benzyl ether, ar-tocopherol, and thioglycerol. 

9. A process of claim 1 , wherein the liquid matrix contains at least 
one functional group that absorbs infrared radiation. 
5 10. A process of claim 9, wherein the functional group is selected 

from the group comprising: nitro, sulfonyl, sulfonic acid, sulfonamide, nitrile. 
carbonyl, aldehyde, carboxylic acid, amide, ester, anhydride, ketone, amine, 
hydroxyl, an aromatic ring and a diene. 

11. A process of claim 1 , wherein the liquid matrix is selected from a 
) group comprising: an alcohol, a carboxylic acid, a primary or secondary amide, 

a primary or secondary amine, a nitrile, hydrazine and hydrazide. 

1 2. A process of claim 1 2, wherein the alcohol is selected from the 
group comprising: glycerol, 1,2- or 1,3- propane diol, 1,2-, 1,3- or 1,4- butane 
diol and triethanolamine. 

1 3. A process of claim 1 1 , wherein the carboxylic acid is selected 
from the group comprising: lactic acid, acetic acid, formic acid, propionic acid, 
butanoic acid, pentanoic acid, hexanoic acid and esters thereof. 

1 4. A process of claim 1 1 , wherein the amide is selected from the 
group comprising: acetamide, propanamide, butanamide, pentanamide and 
hexanamide, whether branched or unbranched. 

15. A process of claim 1 1 , wherein the amine is selected from the 
group comprising: propylamine, butylamine, pentylamine, hexylamine, 
heptylamine, diethylamine and dipropylamine 

16. A process of claim 4, wherein the liquid matrix is comprised of at 
least two liquids, each of which confers at least one of the properties. 

1 7. A process of claim 1 , wherein the liquid matrix comprises an 
additive. 

18. A process of claim 1 7, wherein the additive is selected from the 
group comprising: a compound having a high extinction coefficient at the laser 
wavelength used for the analysis, an additive that acidifies the liquid matrix, an 
additive that minimizes salt formation between the liquid matrix and the 
phosphate backbone of the nucleic acid. 
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19. A process of claim 17, wherein the additive increases the ionic 
strength of the matrix composition. 

20. A process of claim 10, wherein prior to step (a), the liquid matrix 
is treated to minimize salt formation between the matrix and the phosphate 

5 backbone of the nucleic acid. 

21 . A process of claim 1 , wherein the liquid matrix is treated by 

distillation or ion exchange. 

22. A process of claim 1 , wherein the liquid matrix is treated by 

further purification. 

10 23. A process of claim 1 , wherein the liquid matrix is selected from 

the group consisting of glycerol, lactic acid or triethanolamine. 

24. A process of claim 23, wherein the liquid matrix is glycerol and 
the final analyte-to-glycerol molar ratio is about 10 4 to about 10 9 . 

25. A process of claim 1 , wherein the liquid matrix is glycerol, the 
15 mass of the nucleic acid is in the range of from about 10* to about 10*Da and 

the glycerol is subjected to ion exchange prior to step (a). 

26. A process of claim 1 , wherein the nucleic acid is DNA. 

27. A process of claim 26, wherein the DNA comprises is less than 
or equal to about 2000 bases. 

20 28. A process of claim 1 , wherein the nucleic acid is RNA. 

29> A process of claim 20, wherein the RNA comprises is less than 
or equal to about 1200 bases. 

30. A process of claim 1 , wherein the nucleic acid comprises PNA. 

31 . A process of claim 1 , wherein the nucleic acid comprises double- 

25 stranded nucleic acid. 

32. A process of claim 1 , wherein the nucleic acid comprises single- 
stranded nucleic acid. 

33. A process of claim 1 , wherein the infrared radiation is of a 
wavelength in the range of from about 2.5/ym to about 12pm. 

30 34. A process of claim 1 , wherein the radiation pulses have a width 

in the range of about 500 ps to about 500 ns. 
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35. A process of claim 1 , wherein the pulse duration is less than 
200 ns. 

36. A proces of claim 35, wherein the pulse duration is less than 
100 ns. 

5 37. A process of claim 1 , wherein the infrared radiation is generated 

from a source selected from the group comprising: a CO laser, a C0 2 laser, an 
Er laser and an optical parametric oscillator laser emitting in the range of about 
2.5 to about 1 2f/m. 

38. A process of claim 1 , wherein the sample contains less than 
10 about 10 pmoles of nucleic acid. 

39. A process of claim 1 , wherein all or a portion of the process is 
automated. 

40. A process of claim 1 , wherein the sample is cooled to a 
temperature, which is below about 20 °C. 

15 41 . A process of claim 1 , wherein the sample is heated to a 

temperature which is greater than about 20°C and less than about 80°C. 

42. A process of claim 1 , wherein the matrix and sample mixture are 
cooled, whereby the matrix forms a glass. 

43. A process of claim 1 , wherein the glass is glassy water. 

20 44 " A process of claim 1 - wherein the matrix comprises glycerol and 

the glycerol and sample mixture are cooled, whereby the glycerol freezes. 

45. A method of claim 1 , wherein the nucleic acid ions are extracted 
from the ion source by delayed extraction. 

46. A process for analyzing a nucleic acid by mass spectrometry, 
25 comprising the steps of: 

(a) depositing a mixture containing the nucleic acid and a liquid matrix 
on a substrate, thereby forming a homogeneous, thin layer of a nucleic 
acid/liquid matrix mixture; 



30 



(b) illuminating the substrate of (a) with an infrared laser, so that the 
nucleic acid is desorbed and ionized; and 

(O mass separating and detecting the ionized nucleic acid using a mass 
separation and analysis format. 
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47. The process of claim 46, wherein the liquid matrix is sufficiently 
non-volatile to not evaporate during the illuminating, desorbing and ionizing 
step. 

48. The process of claim 46, wherein the liquid matrix can form a 
glass under when cooled and/or pressurized. 

49. The process of claim 46, wherein the matrix comprises a sugar, a 
monosaccharide, or a polysaccharide. 

50. The process of claim 46, wherein the matrix comprises a 
polyglycerol, sucrose, mannose, galactose, ethylene glycol, propylene glycol, 
trimethylolpropane, pentaerythritol, dextrose, methylglycoside or sorbitol, 
sucrose, mannose, triethanolamine, lactic acid, 3-nitrobenzylalcohol, 
diethanolamine, DMSO, nitropheynloctylether (3-NPOE), 2,2'dithiodiethanol, 
tetraethyleneglycol, dithiothrietol/erythritol (DTT/DTE), 2,3-dihydroxy-propyl- 
benzyl ether, a-tocopherol, and thioglycerol. 

51 . A process of claim 46, wherein the liquid matrix has at least one 
of the following properties: i) is miscible with a nucleic acid compatible solvent, 
ii) is vacuum stable, and iii) is of an appropriate viscosity to facilitate 
dispensing of micro- to nano- liter volumes of matrix alone or mixed with a 
nucleic acid compatible solvent. 

52. A process of claim 46, wherein the liquid matrix contains at least 
one functional group that strongly absorbs infrared radiation. 

53. A process of claim 52. wherein the functional group is selected 
from the group comprising: nitro, sulfonyl, sulfonic acid, sulfonamide, nitrile, 
carbonyl, aldehyde, carboxylic acid, amide, ester, anhydride, ketone, amine, 

i hydroxyl, an aromatic ring and a diene. 

54. A process of claim 46, wherein the liquid matrix is selected from 
a group comprising: an alcohol, a carboxylic acid, a primary or secondary 
amide, a primary . or secondary amine, a nitrile, hydrazine and hydrazide, 

55. A process of claim 54, wherein the alcohol is selected from the 
) group comprising: glycerol, 1 ,2- or 1 ,3- propane diol, 1 ,2-, 1 ,3- or 1 ,4- butane 

diol and triethanolamine. 
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56. A process of claim 54, wherein the carboxylic acid is selected 
from the group comprising: lactic acid, acetic acid, formic acid, propionic acid 
butano.c acid, pentanoic acid, hexanoic acid and esters thereof. 

57. A process of claim 54, wherein the amide is selected from the 
5 group comprising: acetamide, propanamide, butanamide, pentanamide and 

hexanamide, whether branched or unbranched. 

58. A process of claim 54, wherein the amine is selected from the 
group comprising: propylamine, butylamine, pentylamine, hexylamine 
heptylamine, diethylamine and dipropylamine 

1° 59. A process of claim 51 , wharein the liquid matrix is comprised of 

at .east two liquids, each of which confers at least one of the properties 

60. A process of claim 46, wherein the liquid matrix comprises an 
additive. 

61 . A process of claim 60, wherein the additive is selected from the 
group comprising: a compound having a high extinction coefficient at the laser 
wavelength used for the analysis, an additive that acidifies the liquid matrix an 
addmve that minimizes sa.t formation between the liquid matrix and the 
phosphate backbone of the nucleic acid. 

62. A process of claim 60, wherein the additive increases the ionic 
20 strength of the matrix composition. 

63. A process of claim 46, wherein prior to step (a), the liquid matrix 
,s treated to minimize sal, formation between the matrix and the phosphate 
backbone of the nucleic acid. 

64. A process of claim 46, wherein the liquid matrix is treated by 
25 distillation or ion exchange. 

65. A process of claim 46, wherein the liquid matrix is treated by 
further purification. 

66. A process of Cairn 46. wherein the liquid matrix is selected from 
the group consisting of glycerol, lactic acid or triethanolamine. 

30 , 67. A process of claim 43, wherein the liquid matrix is glycerol and 

the final analyte-to-glycerol molar ratio is about 10- to about 10* 
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68. A process of claim 46, wherein the liquid matrix is glycerol, the 
mass of the nucleic acid is in the range of from about 10 4 to about 10 6 Da and 
the glycerol is subjected to ion exchange prior to step (a). 

69. A process of claim 46, wherein the nucleic acid is DNA. 

5 70. A process of claim 46, wherein the DNA comprises is less than 

or equal to about 2000 bases. 

71 . A process of claim 46, wherein the nucleic acid is RNA. 

72. A process of claim 71 , wherein the RNA comprises is less than 
or equal to about 1 200 bases. 

10 73. A process of claim 46, wherein the nucleic acid comprises PNA. 

74. A process of claim 46, wherein the nucleic acid comprises 
double-stranded nucleic acid. 

75. A process of claim 46, wherein the nucleic acid comprises single- 
stranded nucleic acid. 

15 76. A process of claim 46, wherein the infrared radiation is of a 

wavelength in the range of from about 2.5//m to about 12//m. 

77. A process of claim 46, wherein the radiation pulses have a width 
in the range of about 500ps to about 500ns. 

78. A process of claim 46, wherein the infrared radiation is generated 
20 from a source selected from the group comprising: a CO laser, a C0 2 laser, an 

Er laser and an optical parametric oscillator laser emitting in the range of about 
2.5 to about 12/vm. 

79. A process of claim 46, wherein the sample contains less than 
about 10 pmoles of nucleic acid. 

25 80. A process of claim 46, wherein all or a portion of the process is 

automated. 

81 . A process of claim 46, wherein the sample is cooled to a 
temperature that is below about 20 °C. 

82. A process of claim 46, wherein the sample is heated to a 
30 temperature which is greater than about 20°C and less than about 80°C. 

83. A process of claim 46, wherein the matrix and sample mixture 
are cooled, whereby the matrix forms a glass. 
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84. A process of claim 46, wherein the glass is glassy water. 

85. A process of claim 46, wherein the matrix comprises glycerol and 
the glycerol and sample mixture are cooled, whereby the glycerol freezes. 

86. A method of claim 46, wherein prior to step <c), the nucleic acid 
5 ions are extracted from the ion source by delayed extraction. 

87. A method of claim 46, wherein the mass separation and analysis 
format is selected from the group consisting of: time-of-flight (TOF), 
quadrupole. magnetic sector, Fourier transform ion cyclotron resonance 
(FTICR), ion trap or a combination thereof. 

3 88. A method of claim 87, wherein the TOF is linear, or the TOF has 

a reflector. 

89. A method of claim 87, wherein the TOF reflector has a linear 
field or a nonlinear field. 

90. A method of claim 87, wherein the quadrupole is single or the 
> quadrupole is multiple. 

91 • A method of claim 87, wherein the magnetic sector is single or 
the magnetic sector is multiple. 

92. A method for determining the size of a primer extension product, 
comprising: 

(a) hybridizing a primer with a target nucleic acid, where the primer (i) is 
complementary to the target nucleic acid; (ii, has a first region containing the 5' 
end of the primer and an immobilization attachment site, and (iii) has a second 
reg,on containing the 3' end of the primer, where the 3' end is capable of 
serving as a priming site for enzymatic extension and where the second region 
contains a selected cleavable site; 

(b) extending the primer enzymatically to generate a polynucleotide 
mixture containing an extension product composed of the primer and an 
extension segment; 

(c) cleaving the extension product at the cleavable site to release the 
extension segment, where prior to the cleaving the primer is immobilized a, the 
immobilization attachment site; and 
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(d) sizing the extension segment by the method of claim 1 , whereby the 
cleaving is effective to increase the read length of the extension segment 
relative to the read length of the product of (b). 

93. A method for determining the size of a primer extension product, 
5 comprising: 

(a) combining first and second primers with a target nucleic acid under 
conditions that promote the hybridization of the primers to the nucleic acid, 
thus generating primer/nucleic acid complexes, where the first primer (i) is 
complementary to the target nucleic acid; (ii) has a first region containing the 5' 
10 end of the primer and an immobilization attachment site, and (iii) has a second 
region containing the 3' end of the primer, where the 3' end is capable of 
serving as a priming site for enzymatic extension and where the second region 
contains a cleavable site, and where the second primer is homologous to the 
target nucleic acid; 

15 (b) converting the primer/nucleic acid complexes to double-stranded 

fragments in the presence of a suitable polymerase and all four dNTPs; 

(c) amplifying the primer-containing fragments by successively repeating 
the steps of (i) denaturing the double-stranded fragments to produce 
single-strand fragments, (ii) hybridizing the single strands with the primers to 

20 form strand/primer complexes, (iii) generating double-stranded fragments from 
the strand/primer complexes in the presence of DNA polymerase and all four 
dNTPs, and (iv) repeating steps (i) to (iii) until a desired degree of amplification 
has been achieved; 

(d) denaturing the amplified fragments to generate a mixture including a 
25 product composed of the first primer and an extension segment; 

(e) immobilizing amplified fragments containing the first primer, utilizing 
the immobilization attachment site, and removing non-immobilized amplified 
fragments; 

(f) cleaving the immobilized fragments at the cleavable site to release 
30 the extension segment; and 
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(9) sizing the extension segment by the method of claim 1, whereby the 
cleaving is effective to increase the read length of the extension segment 
relative to the read length of the product of (d). 

94. A method for determining the DNA sequence of a target DNA 
5 sequence, comprising: 

(a) hybridizing a primer with a target DNA, where the primer (i, is 
complementary to the target DNA; (ii, has a first region containing the 5' end 
of the primer and an immobilization attachment site, and (iii, has a second 
regron containing the 3' end of the primer, where the 3' end is capable of 
10 servmg as a priming site for enzymatic extension and where the second region 
contains a cleavable site; 

(b) extending the primer with an enzyme in the presence of a first of 
four different dideoxy nucleotides to generate a mixture of primer extension 
products each product containing a primer and an extension segment- 
1 & (0 cleaving at the cleavable site to release the extension segments 

where prior to the cleaving the primers are immobilized at the immobilization 
attachment sites; 

(d) sizing the extension segments by the method of claim 1, whereby 
the cleaving is effective to increase the read length of the extension segment 

20 relative to the read length of the product of (b); 

(e) repeating steps (a) through (d, with a second, third, and fourth of the 
four different dideoxy nucleotides; and 

(f) determining the DNA sequence of the target DNA by comparison of 
the s,zes of the extension segments obtained from each of the four extension 

25 reactions. 

95. A method for determining the DNA sequence of a target DNA 
sequence, comprising: 

(a) hybridizing a primer with a target DNA, where the primer (i) is 
complementary to the target DNA; (ii, has a first region containing the 5' end 
of the primer and an immobilization attachment site, and (iii, has a second 
region containing the 3' end of the primer, where the 3' end is capable of 
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serving as a priming site for enzymatic extension and where the second region 
contains a cleavable site; 

(b) extending the primer with an enzyme in the presence of a first of 
four different deoxynucleoside a-thiotriphosphate analogs (dNTPaS) to generate 

5 a mixture of primer extension products containing phosphorothioate linkages; 

(c) treating the primer extension products with a reagent that cleaves 
specifically at the phosphorothioate linkages, where the treating is carried out 
under conditions producing limited cleavage, resulting in the production of a 
group of primer extension degradation products; 

10 (d) washing the primer extension degradation products, where prior to 

the washing, the primer extension degradation products are immobilized at the 
immobilization attachment sites, each immobilized primer extension degradation 
product containing a primer and an extension segment, where the washing is 
effective to remove non-immobilized species; 

15 ( e ) cleaving at the cleavable site to release the extension segments; 

(f) sizing the extension segments by the method of claim 1 , whereby the 
cleaving is effective to increase the read length of any given extension segment 
relative to the read length of its corresponding primer extension degradation 
product; 

20 <g) repeating steps (a) through (f) with a second, third, and fourth of the 

four different dNTPaSs; and 

(h) determining the DNA sequence of the target DNA by comparison of 
the sizes of the extension segments obtained from each of the four extension 
reactions. 

25 96. A method for determining the size of a primer extension product, 

comprising: 

(a) hybridizing a primer with a target nucleic acid, where the primer (i) is 
complementary to the target nucleic acid; (ii) has a first region containing the 5' 
end of the primer, and (iii) has a second region containing the 3' end of the 
30 primer, where the 3' end is capable of serving as a priming site for enzymatic 
extension and where the second region contains a selected cleavable site; 
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(b) extending the primer enzymaticai.y to generate a polynucleotide 
mixture containing an extension product composed of the primer and an 
extension segment; 

(c» cleaving the extension product a, the cleavable site to release the 
b extension segment; and 

W sizing the extension segment by the method of Cairn , . whereby the 
cleavmg is effective to increase the read length o, the extension segment 
relatrve to the read length of the product of (b). 

0 on 91 ' Ame,h ° df0rde,er ™™ n ^ h ^-ofaprimer extension product, 
u comprising: 

com I hVbrifc ' n9 ' PrimeC W ' ,h 3 ,a ' 9et " UC,eiC aCid - Where ^ <» h 

end T ,0 tar9Ct nUC ' eiC aCid: 3 flrSt "*» ™"9 the 5- 

end of the pnmer. and an immobilization attachment site, where the 

.mmobillzation attachment site of the primer is composed o, a series of bases 
complementary to a „ intermediary oligonuc|eotide and ^ ^ 

nta,n,ng , e 3- end of the prime, where the 3- end Is capable of serving s a 

z r/7 r yma,ic e ™ n and whare ,he — »*» " a 

selected cleavable site; 

(b) extending the primer enzymatically to generate a polynucleotide 
m,xture containing an extension product composed of the primer and an 
extension segment; 

(0 cleaving the extension product a, the cleavable site to release the 

zti rr nt ' where prior to ,he c,eavin9 ,he prim - * bv 

pechc hybnd,za,ion o, the immobilization attachment site to the intermediary 
ohgonucleotide bound to a solid support; and 

cleavin^ T eX,enSi °" Sa9mam * ^ °< C ' aim the 
c, av,„ g , s e „ect,ve to increase the reed length of the extension segment 
relative to the read length of the product of (b). 
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98. A method for determining the size of a primer extension product, 
comprising: 

(a) combining first and second primers with a target nucleic acid, under 
conditions that promote hybridization of the primers to the nucleic acid, 

5 generating primer/nucleic acid complexes, where the first primer (i) has a 5' 
end and a 3' end, (ii) is complementary to the target nucleic acid, (hi) has a first 
region containing the 5' end of the first primer and (iv) has a second region 
containing the 3' end of the first primer, where the 3' end is capable of serving 
as a priming site for enzymatic extension and where the second region contains 
10 a cleavable site, and where the second primer fi) has a 5' end and a 3' end, (ii) 
is homologous to the target nucleic acid, (iii) has a first segment containing the 
3' end of the second primer, and (iv) has a second segment containing the 5' 
end of the second primer and an immobilization attachment site; 

(b) converting the primer/nucleic acid complexes to double-stranded 
15 fragments in the presence of a DNA polymerase and deoxynucleoside 

triphosphates; 

(c) amplifying the primer-containing fragments by successively repeating 
the steps of (i) denaturing the double-stranded fragments to produce 
single-stranded fragments, (ii) hybridizing the single stranded fragments with 

20 the first and second primers to form strand/primer complexes, (iii) generating 
amplification products from the strand/primer complexes in the presence of 
DNA polymerase an deoxynucleoside triphosphates, and (iv) repeating steps (i) 
to (iii) until a desired degree of amplification has been achieved; 

(d) immobilizing amplification products containing the second primer via 
25 the immobilization attachment site; 

(e) removing non-immobilized amplified fragments; 

(f) cleaving the immobilized amplification products at the cleavable site, 
to generate a mixture including a double-stranded product; 

(g) denaturing the double-stranded product to release the extension 
30 segment; and 
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(h) sizing the extension segment by the method of Cairn 1 . whereby the 
cleaving is effective to increase the read ,ength of the extension segment 
relative to the read .ength of the amplified strand-primer complexes of (C 

99. A method for determining a single base fingerprint of a target 
UNA sequence, comprising: 

(a) hybridizing a primer with a targat DNA. whara the primer (i) is 
complementary ,o «ha target DNA; ,ii, has a first region containjng ^ g , ^ 
of tha primar and an immobilization attachman, site, and ,iii, has a sacond 
rag,on containing ,ha 3" and of tha primar. whara tha 3- and is capable of 
serving as a priming site for enzymatic extension and where the second region 
contains a selected cleavable.site; 

(b) extending the primar with an enzyme in the presence of a 
dideoxynucleoside triphosphate corresponding to the single base, to generate a 
P V-ucleobde mixture o, primer extension proofs, each product containing a 
primer and an extension segment; 

(O cleaving the extension products a, the cleavable site to release the 
extension segments, where prior to the cleaving the primers ere immobilized a, 
the immobilization attachment sites; 

(d) sizing the extension segments by the method of Cairn 1 . whereby 
•he c.eaving is effective to increase the read length o, any given extension 

oTT h ,0 ' en9,h ° f P*~ -tension 

product of (b); and 

(e) determining the positions of tha singie base in the targe, DNA by 
comparison of the sizes of the extension segments 

compri Z " meth0d " ,in9erPrint " 3 ~~ ° NA S ~ 

(a) hybridizing a primer with a DNA target, where the primer (i) is 
complementary to the target DNA; m has a firs, region containing the 5- and 

the pnmer and an immobilization attachment site, and fiii, has a sacond 
reg,on containing the 3' end of the primer, where the 3' end is capable of 
serving as a priming she for enzymatic extension and where the sacond region 
contains a selected cleavabfe site; 
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(b) extending the primer with an enzyme in the presence of 
deoxyadenosine triphosphate (dATP), deoxythymidine triphosphate (dTTP), 
deoxycytidine triphosphate (dCTP), deoxyguanosine triphosphate (dGTP), and 
deoxyuridine triphosphate (dUTP), to generate a polynucleotide mixture of 

5 primer extension products containing dUTP at positions corresponding to dATP 
in the target, each product containing a primer and an extension segment; 

(c) treating the primer extension products with uracil DNA-glycosylase 
to fragment specifically at dUTP positions to produce a set of primer extension 
degradation products; 

10 (d) washing the primer extension degradation products, where prior to 

the washing, the primer extension degradation products are immobilized at the 
immobilization attachment sites, each immobilized primer extension degradation 
product containing a primer and an extension segment, where the washing is 
effective to remove non-immobilized species; 

15 (e) cleaving the immobilized primer extension degradation products at 

the cleavable site to release the extension segments; 

(f) sizing the extension segments by the method of claim 1, whereby 
the cleaving is effective to increase the read length of any given extension 
segment relative to the read length of its corresponding primer extension 

20 degradation product; and 

(g) determining the positions of adenine in the target DNA by 
comparison of the sizes of the released extension segments. 

101. A method of detecting mutations in a target nucleic acid, 
comprising: 

25 a) obtaining from the target nucleic acid a set of nonrandom length 

fragments {NLFs) in single-stranded form, wherein the set comprises NLFs 
derived from one of either the positive or the negative strand of the target 
nucleic acid or the set is a subset of single-stranded NLFs derived from the 
positive and the negative strand of the target nucleic acid; and 

30 b) determining masses of the members of the set by the method of 

claim 1 . 
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102. A method of detecting mutations in a target nucleic acid, 
comprising: 

a) nonrandomly fragmenting the target nucleic acid with one or more 
restriction endonucleases to form a set of double-stranded NLFs, wherein the 

5 nonrandomly fragmenting further comprises using vola.i.e salts in a restriction 
buffer; and 

b) determining masses of the members of the set of double-stranded 
NLFs by the method of claim 1 . 

103. A method of detecting mutations in a double-stranded target 
10 nucleic acid comprising: 

a) nonrandomly fragmenting the target nucleic acid using one or more 
restriction endonucleases to form a firs, set of nonrandom length fragments 
(NLFs); 

b) hybridizing members of the first set of NLFs to a set of wild type 

15 probes; 

c) nonrandomly fragmenting one or more members of the set of NLFs 
wrth one or more mutation-specific cleaving reagents that specifically cleave at 
any regions of nucleotide mismatch that form between members of the first set 
of NLFs and complementary members of the set of wild type probes, wherein 

20 the nonrandomly fragmenting step forms a second set of NLFs; and 

d) determining masses of members of the second set of NLFs by the 
method of claim 1 . 

104. A method of detecting mutations in a target nucleic acid, 
comprising: 

a) nonrandomly fragmenting the target nucleic acid, using a mixture 
comprising one or more yolatile salts to form a set of nonrandom length 
fragments (NLFs); and 

b. determining masses of members of the se, of NLFs by the method of 
claim 1. 

105. The method of claim 1 further comprising washing the nucleic 
acd sample with a mixture of volatile salts, and evaporating the mixture of 
volatile salts from the sample, thereby decreasing background noise 
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106. A method for analyzing DNA tandem nucleotide repeat alleles at 
a DNA tandem nucleotide repeat locus in a target nucleic acid by mass 
spectrometry, the method comprising: 

a) obtaining a target nucleic acid comprising a DNA tandem nucleotide 
5 repeat region; 

b) extending the target nucleic acid using one or more primers to obtain 
a limited size range of nucleic acid extension products, wherein one or more 
primers are complementary to a sequence flanking the DNA tandem nucleotide 
repeat of the locus; and 

10 c) determining the mass of the nucleic acid extension products by the 

method of claim 1 . 

107. A method for multiplexing the identification of more than one 
DNA tandem nucleotide repeat regions from more than one DNA tandem 
nucleotide repeat loci by mass spectrometry, which method comprises: 

15 a) obtaining more than one nucleic acid extension products by extending 

one or more primers complementary to sequences flanking the DNA tandem 
nucleotide repeat regions; and 

b) determining the mass of the more than one nucleic acid extension 
products simultaneously by the method of claim 1 , wherein the nucleic acid 

20 extension products have overlapping allelic mass ranges. 

108. A method for multiplexing the identification of more than one 
DNA tandem nucleotide repeat regions for more than one DNA tandem 
nucleotide repeat loci, which method comprises: 

a) obtaining more than one nucleic acid amplification products by 

25 amplifying two or more primers complementary to sequences flanking the DNA 
tandem nucleotide repeat regions; and 

b) determining the masses of more than one nucleic acid amplification 
products simultaneously by the method of claim 1 , wherein the nucleic acid 
extension products have overlapping allelic mass ranges. 

30 109. A method for analyzing DNA tandem nucleotide repeat alleles at 

a DNA tandem nucleotide repeat locus in a target nucleic acid by mass 
spectrometry, the method comprising 
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a) obtaining a target nucleic acid comprising a DNA tandem nucleotide 
repeat region; 

b) extending the target nucleic acid usgin one or or more primers to 
obtain a limited size range of nucleic acid extension products, wherein one or 

5 more primers are complementary to a sequence flanking the DNA tandem 
nucleotide repeat of the locus; and 

c) determining the mass of the nucleic acid extension products by the 
method of claim 1 . 

1 10. The method of claim 109, wherein a 3' end of one or more 
10 primers immediately flanks a DNA tandem nucleotide repeat region. 

111. The method of claim 1 09, wherein one or more primers comprise 
a sequence complementary to up to one tandem repeat of the DNA tandem 
nucleotide repeat locus. 

1 1 2. The method of claim 111, wherein one or more primers comprise 
15 a sequence complementary to up to two tandem repeats of the DNA tandem 

nucleotide repeat locus. 

113. The method of claim 1 1 2, wherein one or more primers comprise 
a sequence complementary to up to three tandem repeats of the DNA tandem 
nucleotide repeat locus. 

1 1 4. The method of claim 1 09, wherein at least one primer comprises 
a cleavable site. 

115. The method of claim 1 1 4, wherein the eleavable site comprises a 
recognition site for a restriction endonuclease, an exonuclease blocking site, or 
a chemically cleavable site. 

116. The method of claim 114, wherein wherein at least one primer is 
capable of attaching to a solid support. 

1 1 7. The method of claim 1 1 6, wherein at least one primer comprises 
biotin or digoxigenin. 

1 18. The method of claim 109, wherein the extension of at least one 
primer is terminated using a chain termination reagent. 

119. The method of claim 109, wherein the chain termination reagent 
is a dideoxynucleotide triphospate. 
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120. A method for detecting a target molecule, comprising: 

(a) obtaining a target molecule; 

(b) amplifying the target molecule to produce an amplified target 
molecule; 

(c) obtaining a probe comprising a reactive group, a release group and a 
mass label; 

(d) hybridizing the amplified target molecule to the probe to produce a 
probe:amplified target molecule complex; 

(e) releasing the mass label from the probe:amplified target molecule 
complex to obtain a released mass label; and 

(f ) determining the mass of the released mass label by the method of 
claim 1 . 

121. The method of claim 1 20, wherein prior to step (f), the mass 
label is mixed with the liquid matrix. 

1 22. The method of claim 1 21 , wherein the liquid matrix is glycerol. 

123. A method for detecting a target molecule, comprising: 

(a) obtaining a probe comprising a reactive group, a release group and i 
mass label; 

(b) obtaining a target molecule; 

(c) contacting the target molecule with the probe to produce a 
probe:target molecule complex; 

(d) releasing the mass label from the probe:target molecule complex; 

and 

(e) determining the mass of the mass label by the method of claim 1 . 

124. The method of claim 123, wherein the mass label is nonvolatile 
and the mass label is selectively released from the probe:target molecule 
complex. 

1 25. The method of claim 1 23, wherein prior to step (e), the mass 
label is mixed with a liquid matrix. 

126. The method of claim 1 24, wherein the matrix is glycerol. 

1 27. A method for multiplexing the detection of a target molecule, 
comprising: 
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(a) obtaining a plurality of probes, each comprising a reactive group, a 
release group and a mass label; 

(b) contacting the target molecule with the plurality of probes to 
produce probertarget molecule complexes, wherein the target molecule is 

5 attached to the reactive group of the probe; 

(c) releasing the mass labels from the probe:target molecule complexes 
to produce released mass labels; and 

(d) determining the mass of the released mass labels by the method of 
claim 1, wherein each reactive group in a probertarget molecule complex is 

10 associated with a unique set of mass labels. 

1 28. The method of claim 1 27, wherein prior to step (d), the mass 
labels are mixed with a liquid matrix. 

129. The method of claim 1 27, wherein the matrix is glycerol. 

130. A method for multiplexing the detection of a plurality of target 
15 molecules, comprising: 

(a) obtaining a plurality of probes, each comprising a reactive group, a 
release group and a mass label; 

(b) contacting the plurality of target molecules with the plurality of 
probes to produce probertarget molecule complexes, wherein target molecules 

20 are attached to the reactive groups of the probes; 

(c) releasing the mass labels from the probertarget molecule complexes 
to produce released mass labels; and 

(d) determining the mass of the released mass labels by the method of 
claim 1, wherein each reactive group specific for a particular target molecule is 

25 associated with a unique mass label. 

131 . The method of claim 1 30, wherein prior to step (d), the mass 
labels are mixed with a liquid matrix. 

1 32. The method of claim 131, wherein the matrix is glycerol. 

1 33. A process for detecting a target biological macromolecule, 
30 comprising the steps ofr 

a) preparing a mixture, comprising a biological macromolecule 
and a liquid matrix, which absorbs infrared radiation; and 
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b) performing IR-MALDI mass spectrometry on the mixture to 
identify the target biological macromolecule in the mixture, thereby 
detecting the target biological macromolecule. 

134. The process of claim 133, wherein the target biological 

5 macromolecule is in a biological sample, whereby detection of the target 
biological macromolecule identifies the presence of the target biological 
macromolecule in the biological sample. 

135. The process of claim 133, wherein the biological macromolecule 

is a nucleic acid. 

10 136. The process of claim 133, wherein the biological macromolecule 

is a polypeptide. 

137. The process of claim 133, wherein the biological macromolecule 
is selected from the group consisting of a carbohydrate, lipid, a nucleoprotein, 
a proteoglycan, and a macromolecular complex. 
! 5 1 38. The process of claim 1 33, wherein the target biological 

macromolecule is immobilized on a solid support. 

1 39. The process of claim 1 33, wherein the target biological 
macromolecule is immobilized to the solid support via a reversible linkage. 

1 40. The process of claim 1 33, wherein the target biological 

20 macromolecule is immobilized to the solid support via a photocleavable bond. 

1 41 . The process of claim 1 38, wherein the reversible linkage is a thiol 
linkage or an ionic bond. 

142. The process of claim 138, wherein the target biological 
macromolecule is cleaved from the support during the step of performing IR- 

25 MALDI mass spectrometry. 

143. The process of claim 138, wherein the solid support is selected 
from the group consisting of a bead, a flat surface, a chip, a capillary, a pin, a 
comb, a wafer, a wafer with an arrow of nano-wells or pits, the terminus of a 
fiber optic cable, a support with a surface that comprises hydrophoic regions 

30 and hydrophillic regions, whereby the target molecule is contrained to a locus 
on the support. 
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144. The process of claim 138, wherein the solid support is a materia! 
selected from the group consisting of a metal, a ceramic, a plastic, a resin, a 
gel, and a membrane. 

145. The process of claim 138, wherein the support is a silicon wafer, 
and wherein the target biological macromolecule is immobilized in an array. 

146. The process of claim 135, wherein a target nucleic acid is 
immobilized by hybridization to a complementary capture nucleic acid molecule, 
which is immobilized to a solid support. 

147. The process of claim 133, wherein the target biological 
macromolecule is conditioned prior to the step of performing IR-MALDI mass 
spectrometry. 

148. The process of claim 147, wherein the target biological 
macromolecule is conditioned by ion exchange. 

149. The process of claim 135, wherein the target nucleic acid is 
conditioned by a method selected from the group consisting of phosphodiester 
backbone modification effected by cation exchange; contact with an alkylating 
agent or trialkylsilyl chloride; incorporation of at least one nucleotide that 
reduces sensitivity for depurination in the target nucleic acid; incorporation of 
at least one mass modified nucleotide in the target nucleic acid; hybridization of 
a tag probe to a portion of a nucleic acid molecule that contains the target 
nucleic acid but is distinct from a target nucleic acid sequence. 

1 50. The process of claim 1 36, wherein the target polypeptide is 
obtained by in vitro translation, or by in vitro transcription followed by 
translation, of a nucleic acid encoding the target polypeptide. 

151. The process of claim 1 50, wherein the nucleic acid encoding the 
target polypeptide further comprises a nucleotide sequence encoding a second 
polypeptide. 

1 52. The process of claim 1 36, wherein the target polypeptide 
comprises a tag. 

1 53. A process for detecting the presence of a target nucleic acid 
sequence in a biological sample containing nucleic acid molecules, comprising 
the steps of: 
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a) contacting the nucleic acid molecules with a detector 
oligonucleotide, which can hybridize to a target nucleic acid sequence 
present in the biological sample; 

b) preparing a mixture for IR-MALDI, comprising the product of 
5 step a) and a liquid matrix, which absorbs infrared radiation; 

c) identifying duplex nucleic acid molecules in the mixture by 
IR-MALDI mass spectrometry, thereby detecting presence of the target 
nucleic acid sequence in the biological sample. 

1 54. The process of claim 1 53, further comprising, prior to step a), 
10 a step of amplifying the nucleic acid molecules in the biological sample. 

155. A process for detecting the presence of a target nucleic acid 
sequence in a biological sample containing nucleic acid molecules, comprising 
the steps of: 

a) specifically digesting the nucleic acid molecules using at least 
15 one appropriate nuclease, thereby producing digested fragments; 

b) hybridizing the digested fragments with complementary 
capture nucleic acid sequences, which are immobilized on a solid 
support and can hybridize to a digested fragment of a target nucleic acid 
to produce immobilized fragments; 

20 C ) preparing a mixture for IR-MALDI, comprising the immobilized 

fragments and a liquid matrix, which absorbs infrared radiation; and 

d) identifying immobilized fragments by IR-MALDI mass 
spectrometry, thereby detecting the presence of the target nucleic acid 
sequence in the biological sample. 
2 5 1 56. The process of claim 1 55, further comprising, prior to step a), a 

step of amplifying the nucleic acid molecules in the biological sample. 
1 57. A process for detecting a target nucleic acid sequence, 

comprising the steps of: 

a) performing at least one hybridization on a nucleic acid 
30 molecule containing the target nucleic acid sequence with a set of 

ligation educts and a thermostable DNA ligase; 
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b) preparing a mixture for IR-MALDI, comprising the product of 
step a) and a liquid matrix, which absorbs infrared radiation; and 

c) identifying a ligation product in the mixture by IR-MALDI mass 
spectrometry, thereby detecting the target nucleic acid sequence. 

5 158. A process for detecting the presence of a target nucleic acid in a 

biological sample containing nucleic acid molecules, comprising the steps of: 

a) performing on the nucleic acid molecules, a first polymerase 
chain reaction using a first set of primers, which are capable of 
amplifying a portion of a nucleic acid molecule containing the target 

1 0 nucleic acid, thereby producing a first amplification product; 

b) preparing a mixture for IR-MALDI, comprising the first 
amplification product and a liquid matrix, which absorbs infrared 
radiation; and 

c) detecting the first amplification product in the mixture by 
15 IR-MALDI mass spectrometry, thereby detecting the presence of the 

target nucleic acid in the biological sample. 

1 59. The process of claim 1 58, wherein prior to step b), a second 
polymerase chain reaction is performed on the first amplification product using 
a second set of primers, which are capable of amplifying at least a portion of 

20 the first amplification product, which contains the target nucleic acid. 

1 60. A process for determining the identity of a target nucleotide, 
comprising the steps of: 

a) hybridizing a nucleic acid molecule containing the target 
nucleotide with a primer oligonucleotide that is complementary to the 

25 nucleic acid molecule at a site adjacent to the target nucleotide, thereby 

producing a hybridized nucleic acid molecule; 

b) contacting the hybridized nucleic acid molecule with a 
complete set of dideoxynucleosides or 3'-deoxynucleoside triphosphates 
and a DNA dependent DNA polymerase, so that only the 

30 dideoxynucleosides or 3'-deoxynucleoside triphosphate that is 

complementary to the target nucleotide is extended onto the primer, 
thereby producing an extended primer; 



SDOC1D: <WO 99573 1BA2_I_> 



WO 99/57318 



PCT/US99/10251 



-189- 

c) preparing a mixture for IR-MALDI, comprising the extended 
primer and a liquid matrix, which absorbs infrared radiation; and 

d) detecting the extended primer in the mixture by IR-MALDI 
mass spectrometry, thereby determining the. identity of the target 
nucleotide. 

161 . A process for detecting the presence or absence of a mutation in 
a target nucleic acid sequence, comprising the steps of: 

a) hybridizing a nucleic acid molecule containing the target 
nucleic acid sequence with at least one primer, the primer having 

3' terminal base complementarity to the target nucleic acid sequence, 
thereby producing a hybridized product; 

b) contacting the hybridized product with an appropriate 
polymerase enzyme and sequentially with one of the four nucleoside 
triphosphates; 

c) preparing a mixture for IR-MALDI, comprising the product of 
step b) and a liquid matrix, which absorbs infrared radiation; and 

d) detecting the product of step b) in the mixture by IR-MALDi 
mass spectrometry, wherein the molecular weight of the product 
indicates the presence or absence of a mutation next to the 3' end of 
the primer in the target nucleic acid molecule. 

162. The process of claim 161 , wherein the nucleic acid molecule 
containing the target nucleic acid sequence is immobilized to a solid support. 

163. The process of claim 161, wherein, prior to step a), the nucleic 
acid molecule containing the target nucleic acid sequence is amplified. 

164. A process for detecting a mutation in a target nucleic acid, 
comprising the steps of: 

a) hybridizing the target nucleic acid with an oligonucleotide 
probe, to produce a hybridized target nucleic acid, wherein a mismatch 
is formed at the site of a mutation; 

b) contacting the hybridized target nucleic acid with a single 
strand specific endonuclease; 
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c) preparing a mixture for IR-MALDI, comprising the product of 
step b) and a liquid matrix, which absorbs infrared radiation; and 

d) analyzing the mixture by IR-MALDI mass spectrometry, 
wherein the presence of more than one fragment of the target nucleic 
acid in the mixture detects a mutation the target nucleic acid. 

165. A process for identifying the absence or presence of a mutation 
in a target nucleic acid sequence, comprising the steps of: 

a) performing at least one hybridization on a nucleic acid 
molecule containing the target nucleic acid sequence with a set of 
ligation educts and a DNA ligase; 

b) preparing a. mixture for IR-MALDI, comprising the product of 
step a) and a liquid matrix, which absorbs infrared radiation; and 

c) analyzing the mixture by IR-MALDI mass spectrometry, 
wherein detecting a ligation product in the mixture identifies the 

absence of a mutation in the target nucleic acid sequence, and 

wherein detecting only the set of ligation educts in the mixture identifies 
the presence of a mutation in the target nucleic sequence. 

1 66. A process for determining the identity of each target biological 
macromolecule in a plurality of target biological macromolecules, comprising 
the steps of: 

a) preparing a mixture for IR-MALDI, comprising a plurality of 
differentially mass modified target biological macromolecules and a 
liquid matrix, which absorbs infrared radiation; 

b) determining the molecular mass of each differentially mass 
modified target biological macromolecule in the plurality by IR-MALDI 
mass spectrometry; and 

c) comparing the molecular mass of each differentially mass 
modified target biological macromolecule in the plurality with the 
molecular mass of a corresponding known biological macromolecule, 
thereby determining the identity of each target biological macromolecule 
in the plurality of target biological macromolecules or fragments thereof. 
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167. The process of claim 166, wherein each target biological 
macromolecule in the plurality is a fragment of a biological macromolecule, 
each fragment prepared by contacting the biological macromolecule with at 
least one agent that cleaves a bond involved in the formation of the biological 
macromolecule. 

1 68. A process for determining the sequence of a target biological 
macromolecule, comprising the steps of: 

a) generating at least two biological macromolecule fragments 
from the target biological macromolecule; 

b) preparing a mixture for IR-MALDI, comprising the biological 
macromolecule fragments and a liquid matrix, which absorbs infrared 
radiation; and 

c) analyzing the biological macromolecule fragments in the 
mixture by IR-MALDI mass spectrometry, thereby determining the 
sequence of the target nucleic acid molecule. 

1 69. A process of determining the subunit sequence of at least one 
species of target biological macromolecule, i, comprising the steps of: 

a) contacting the species of target biological macromolecule with 
at least one agent that cleaves a bond involved in the formation of the 
target biological macromolecule such that each bond in involved in the 
formation of the target biological macromolecule is cleaved, thereby 
producing a nested set of deletion fragments of the species of biological 
macromolecule; 

b) preparing a mixture for IR-MALDI, comprising the nested set 
of deletion fragments and a liquid matrix, which absorbs infrared 
radiation; and 

c) determining the molecular mass of each deletion fragment in 
mixture by IR-MALDI mass spectrometry, thereby determining the 
subunit sequence of the species of target biological macromolecule. 

170. The process of claim 169, wherein the agent that cleaves is an 
agent that cleaves the target biological macromolecule unilaterally from a 
terminus. 
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171. The process of claim 170, wherein the target biological 
macromolecule is a nucleic acid and the agent that cleaves is an exonuclease. 

172. The process of claim 170, wherein the target biological 
macromolecule is a polypeptide and the agent that cleaves is an exopeptidase. 

1 73. The process of claim 1 70, wherein the at least one species of 
target biological macromolecule comprises i + 1 species of target biological 
macromolecules, and wherein each species of target biological macromolecule 
is differentially mass modified such that a deletion fragment of each species of 
target biological macromolecule can be distinguished from a deletion fragment 
of every other target biological macromolecule by IR-MALDI mass 
spectrometry. 

1 74. A process of determining the nucleotide sequence of at least one 
species of nucleic acid, i, comprising the steps of: 

a) synthesizing complementary nucleic acids, which are 
complementary to the species of nucleic acid to be sequenced, starting 
from an oligonucleotide primer and in the presence of chain terminating 
nucleoside triphosphates, thereby producing four sets of base- 
specifically terminated complementary polynucleotide fragments; 

b) preparing a mixture for IR-MALDI, comprising the four sets of 
polynucleotide fragments and a liquid matrix, which absorbs infrared 
radiation; and 

c) determining the molecular weight value of each polynucleotide 
fragment by IR-MALDI mass spectrometry; and 

d) determining the nucleotide sequence of the species of nucleic 
acid by aligning the molecular weight values according to molecular 
weight. 

175. The method of claim 173, wherein the species of nucleic acid is 
RNA, the chain terminating nucleoside triphosphates are ribonucleotide 
triphosphates or derivatives thereof, and the oligonucleotide primer is an 
initiator oligonucleotide. 

176. The method of claim 174, wherein i+1 species of nucleic acids 
are concurrently sequenced by multiplex mass spectrometric nucleic acid 
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sequencing employing i + 1 primers, wherein one of the i + 1 primers is an 
unmodified primer or a mass modified primer and the other i primers are mass 
modified primers, and each of the i + 1 primers can be distinguished from the 
other by IR-MALDI mass spectrometry. 

177. A method for determining a sequence of a target nucleic acid, 
comprising the steps of: 

a) hybridizing at least one partially single stranded target nucleic 
acid to one or more nucleic acid probes, each probe comprising a double 
stranded portion, a single stranded portion, and a determinable variable 
sequence within the single stranded portion, thereby producing at least 
one hybridized target nucleic acid; 

b) preparing a mixture for IR-MALDI, comprising the hybridized 
target nucleic acid and a liquid matrix, which absorbs infrared radiation; 

c) determining a sequence of the hybridized target nucleic acid 
by IR-MALDI mass spectrometry based on the determinable variable 
sequence of the probe to which the target nucleic acid hybridized; and 

d) repeating steps a) to c) a sufficient number of times to 
determine a sequence of the target nucleic acid. 

178. The method of claim 170, wherein the one or more nucleic acid 
probes are immobilized in an array. 

179. The method of claim 170, wherein, prior to step b), the 
hybridized target nucleic acid is ligated to the determinable variable sequence. 

1 80. A method for analyzing biological macromolecules in a sample 
comprising: 

exposing the sample to infrared radiation; and 

analyzing the sample using matrix-assisted laser desorption/ionization 
mass spectrometry; 

wherein the accuracy of mass determination by matrix-assisted laser 
desorption/ionization is in the range of about 10 2 to about 5 x 10 3 ppm. 

181. The method of claim 180, wherein the accuracy of mass 
determination by matrix-assisted laser desorption/ionization is in the range of 
about 100 to about 500 ppm. 
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182. The method of claim 180, wherein the accuracy of mass 
determination by matrix-assisted laser desorption/ionization is in the range of 
about 1 x 10 3 to about 5 x 10 3 ppm. 

183. The method of claim 180, wherein the molecular weight of the 
biological macromolecule is 150 kDa or less; and 

the precision of mass determination by matrix-assisted laser 
desorption/ionization is in the range of about 400-500 ppm. 

1 84. The method of claim 180, wherein the molecular weight of the 
biological macromolecule is 1 50 kDa or less; and 

the precision of mass determination by matrix-assisted laser 
desorption/ionization is in the range of about 200-400 ppm. 

185. The method of claim 180, wherein the molecular weight of the 
biological macromolecule exceeds 1 MDa (megadalton). 

186. The method of claim 180, further comprising extracting the ions 
from the ion source by delayed extraction, whereby enhanced mass remixture 
is achieved. 

187. The method claim 180, wherein the matrix is glycerol. 

188. The method of claim 180, wherein the matrix is succinic acid. 

189. The method of claim 180, wherein the biological macromolecule 
is a polypeptide greater than 50 kDa and the matrix-assisted laser 
desorption/ionization mass spectrometry is performed using a reflectron time- 
of-flight format. 

1 90. The method of claim 1 80 that is is a method for detecting the 
presence or absence of a nucleic acid in a sample, wherein: 

the sample is mixed with a matrix to form a homogeneous mixture with 
the nucleic acid; and 

the nucleic acid is detected if present in the sample. 

191. The method of claim 190, wherein the sample is a biological 
sample. 

192. The method of claim 190, wherein the nucleic acid comprises at 
least 2000 nucleotides. 
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193. The method of claim 190, wherein the nucleic acid comprises at 
least 280 nucleotides. 

194. The method of claim 190, wherein the nucleic acid is DNA. 

195. The method of claim 190, wherein the nucleic acid comprises at 
5 least 1200 nucleotides. 

196. The method of claim 190, wherein the nucleic acid is RNA. 

197. The method of claim 190, wherein the presence or absence of 
the nucleic acid is indicative of the presence or absence of a genetic disease. 

198. The method of claim 190, wherein the presence or absence of 
0 the nucleic acid is indicative of the presence or absence of a birth defect. 

199. The method of claim 190, wherein the presence or absence of 
the nucleic acid is indicative of the presence or absence of an infectious 
organism. 

200. The method of claim 190, wherein the presence or absence of 
the nucleic acid is indicative of the identity of a subject. 

201 . The method of claim 190, wherein the matrix is a substituted or 
unsubstituted alcohol. 

202. The method of claim 1 90, wherein the nucleic acid/matrix 
mixture is deposited onto fields of a chip array, arrays of pins or beads in pits 
of flat surfaces. 

203. A method for determining the presence or absence of a target 
biological macromolecule in a sample, comprising: 

analyzing the sample using infrared matrix-assisted laser 
desorption/ionization mass spectrometry; 

wherein the target biological macromolecule is detected if present in the 
sample. 

204. The method of claim 203, wherein the sample is a biological 
sample. 

205. The method of claim 203, wherein the sample contains one or 
more biological macromolecules other than the target biological macromolecule. 

206. The method of claim 203, wherein the target macromolecule is a 
biopolymer. 
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207. The method of claim 206, wherein the target biopolymer is a 
polypeptide. 

208. The method of claim 206, wherein the target biological 
macromolecule is a nucleic acid. 

209. The method of claim 206, wherein the target nucleic acid is a 
double-stranded nucleic acid or the target nucleic acid is a single-stranded 
nucleic acid or the target nucleic acid comprises double-stranded and single- 
stranded regions. 

210. The method of claim 206, wherein the presence or absence of 
the target biological macromolecule is indicative of one or more of the 
following: 

the presence or absence of a genetic disease or is indicative of the 
presence or absence of a birth defect; 

the presence or absence of an infectious organism; 
the identity of a subject. 

21 1. The method of claim 203, wherein delayed ion extraction is used 
in the matrix-assisted laser desorption/ionization mass spectrometry. 

212. The method of claim 203, wherein the biological macromolecule 
is selected from the group consisting of a carbohydrate, a nucleoprotein, a 
proteoglycan, lipids, nucleic acid analogs and a macromolecular complex. 

213. The method of claim 203, wherein the target biological 
macromolecule is immobilized on a solid support. 

214. The method of claim 213, wherein the target biological 
macromolecule is immobilized to the solid support via a cleavable linkage and/or 
a reversible linkage. 

215. The method of claim 213, wherein the target biological 
macromolecule is cleaved from the support during the step of performing 
matrix-assisted laser desorption/ionization mass spectrometry. 

216. The method of claim 213, wherein the support comprises one or 
more hydrophilic areas and each of the one or more areas is surrounded by a 
hydrophobic area or one or more hydrophobic areas and each of the one or 
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more areas is surrounded by a hydrophilic area or is the terminus of a fiber 
optic cable. 

217. The method of claim 203, wherein a target nucleic acid is 
immobilized by hybridization to a complementary capture nucleic acid molecule, 

5 which is immobilized to a solid support. 

218. The method of claim 203, wherein the target biological 
macromolecule is conditioned prior to the step of performing matrix-assisted 
laser desorption/ionization mass spectrometry. 

219. The method of claim 203, wherein the target biopolymer is 

10 conditioned by a method selected from the group consisting of ion exchange, 
phosphodiester backbone modification effected by cation exchange; contact 
with an alkylating agent or trialkylsilyl chloride; incorporation of at least one 
nucleotide that reduces sensitivity for depurination in the target nucleic acid; 
incorporation of at least one mass modified nucleotide in the target nucleic 

15 acid; hybridization of a tag probe to a portion of a nucleic acid molecule that 
contains the target nucleic acid but is distinct from a target nucleic acid 
sequence. 

220. The method of claim 208, wherein the target polypeptide is 
obtained by in vitro translation, or by in vitro transcription followed by 

20 translation, of a nucleic acid encoding the target polypeptide. 

221 . The method of claim 203, wherein the nucleic acid encoding the 
target polypeptide further comprises a nucleotide sequence encoding a second 
polypeptide. 

222. The method of claim 208, wherein the target polypeptide 
25 comprises a tag. 

223. The method of claim 203, wherein: 

the sample comprises a matrix; is glycerol; and 

the biological macromolecule is a nucleic acid with a mass in the range 
from about 1x10" Daltons to about 1 x 10 6 Daltons. 
30 224. The method of claim 203, further comprising, prior to analyzing 

the sample using matrix-assisted laser desorption/ionization mass spectrometry, 
depositing the sample on a substrate. 
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225. The method of claim 224, wherein said depositing with an 
automated liquid dispensing device. 

226. The method of claim 203, wherein infrared matrix-assisted laser 
desorption/ionization mass spectrometry is performed at a temperature in the 

5 range of about -80°C to 20°C. 
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